Charco Hui

Data Scientist &
Developer

Enjoy playing around with data and developing software.

01.

About

Charco Hui

Data Scientist & Developer

I'm a developer and PhD candidate at The University of Auckland researching in "Natural Language Processing in Clinicial Trials". I working across a variety of fields to solve complex problems with technology. While my main focus is on AI and machine learning, I also take on projects in data science, software development, web development, and blockchain, depending on what interests me at the time or what the job requires.

I enjoy exploring different areas of technology, applying AI to create innovative solutions and combining it with practical skills in development to bring ideas to life. Whether it's building predictive models, diving into data analysis, or creating user-friendly applications.

02.

Experience

Developer @ The University of Auckland

2023 - Present

Maintain and develop iNZight Lite, a data analytic tool, and an internal tool for the staff at the Centre for Innovation and Entrepreneurship to assist with research. My responsibilities include adding new features, fixing bugs, ensuring smooth performance, providing assistance and support for users and researchers.

StatisticsRShinyTidyverseDocker

Freelancer

2023 - Present

Developer

Build robust and scalable applications, handling both front-end and back-end development for web applications, as well as developing software tools and systems. I’ve previously developed lending systems, integrated chatbots, worked on blockchain tools, and created interactive data dashboards to assist clients in gaining insights from complex data.

Data Scientist

Offer services ranging from data analysis to data mining, data manipulation and webscraping. I help clients extract insights from large datasets, employing statistical methods and algorithms. My expertise also includes machine learning, where I implement predictive models that allow clients to perform classification and forecast future trends. I further extend my work into AI, creating chatbots, AI agents, and even image detection systems.

Instructor

Design and deliver introductory courses on AI and Python programming in Hong Kong, specifically targeted at primary and secondary school students. I also coordinate with fellow instructors to continuously improve the course materials and teaching methods, ensuring the best learning experience for students.

Reference up on request.

PythonRTypeScriptReactStatisticsMachine LearningArtificial intelligence

Lead Developer @ Blockchain Startup

2023 - 2024

Development of a high-throughput indexer to collect real-time data from Solana and Ethereum, and aggregate it for specific use cases. Lead the design and implementation of a message-broker system (e.g., RabbitMQ) to efficiently handle user operations and maintain system performance. Contribute to full-stack web development and oversee Docker containerization for the entire infrastructure. Manage the design of databases for performance and efficiency, ensuring smooth data flow and scalability. As the lead developer, play a key role in multiple aspects of the project, collaborating across teams to ensure the system's overall efficiency and reliability. Startup reached an approximate $5 million USD market cap at its peak.

Reference up on request.

BlockchainPythonGolangRustRabbitMQTypeScriptReactDockerPostgreSQL

Research Assistant @ Various Universities

2017 - 2023

Worked for several universities, including The University of Auckland, Syracuse University and University of Minnesota. My main responsibilities include web scraping, data analysis, data mining and machine learning. Notable contributions:

Lucas, David & Bellavitis, Cristiano & Park, U. David. (2023). A Cloud's Silver Lining? The Impact of Policy Interventions on New and Maturing Technology Ventures’ Online Recruitment. Strategic Entrepreneurship Journal.
Bellavitis, Cristiano & Cumming, Douglas & Vanacker, Tom. (2020). Ban, Boom, and Echo! Entrepreneurship and Initial Coin Offerings. Entrepreneurship Theory and Practice.
Bellavitis, Cristiano & Rietveld, Joost & Filatotchev, Igor. (2019). The Effects of Prior Co‐Investments On The Performance of VC Syndicates: A Relational Agency Perspective. Strategic Entrepreneurship Journal.

StatisticsMachine LearningPythonRTypeScriptReact

Graduate Teaching Assistant @ The University of Auckland

2020 - 2023

Provide academic assistance to statistics students by teaching tutorials for classes involving data analysis, R programming, machine learning, and data science, in addition to marking assignments and exams for these courses.

StatisticsData ScienceMachine LearningR

Teaching Assistant / Marker @ The University of Auckland

2017 - 2019

Provide academic assistance to statistics students by teaching tutorials for classes involving SAS programming, data analysis, and R programming, as well as marking assignments for these courses.

StatisticsRSAS

03.

Education

Doctor of Philosophy - PhD, Statistics

The University of Auckland

2020 - Present

Research

By using clinical trials (CT), high quality decisions can be made to benefit the healthcare systems, but it can often become expensive and are prone to failure. With the large amount of clinical data from the past, more modern techniques can be applied to gain inference. Natural language processing (NLP) techniques are now widely used in different areas like grammar induction and text-to-speed transformation. However, NLP is not yet mature in the CT area. It would be beneficial to include NLP techniques to automate certain tasks like processing clinical reports to obtain important information and optimizing trial designs. This could potentially lower the cost and decrease failure rates substantially.

Therefore, we propose to develop:

NLP algorithms to learn from and process existing trials
algorithms to assess analytical methods to optimize trials
algorithms to facilitate decision

Master of Science - MS, Statistics

The University of Auckland

2018 - 2019

Research

In the recent update of iNZight, a joining and reshaping module was introduced. This module provides a tool for users to join and reshape data sets. Though it is not difficult to apply these operations and obtain the result with a tool like iNZight, there is a lack of tutorials or tools that explain the underlying process of these operations in an easy to understand manner. In this report, why these data operations are useful and important will be discussed. Software will also be developed that attempts to teach key concepts associated with joining and reshaping data sets through animations.

Bachelor of Science Honours (BScH), Statistics

The University of Auckland

2017 - 2018

Research

Most current software for survey analysis reads the data into memory, however, most of these computations can actually be expressed as database operations. The aim of this package is to provide a set of functions which allows survey statistics to be computed directly inside a database. To do this, we used dplyr and dbplyr. Lastly, this package takes a step further, it also provides graphics computation directly inside a database, where data tables are read into memory only when necessary.

Bachelor of Science (B.Sc.), Statistics/Computer Science

The University of Auckland

2012 - 2016

05.

Papers

Batch Aggregation: An Approach to Enhance Text Classification with Correlated Augmented Data

2025

A technique which improves text classification by modelling correlation between augmented texts, significantly boosting accuracy in specialized domains when labeled data is limited.

05.

Projects

dataAnim

110

R package for visualising data transformation using animations.

JavaScript

svydb

Survey statistics in a database

06.

Skills

Loading ...

Charco Hui

Data Scientist &Developer

About

Charco Hui

Data Scientist & Developer

Experience

Developer @ The University of Auckland

Freelancer

Developer

Data Scientist

Instructor

Lead Developer @ Blockchain Startup

Research Assistant @ Various Universities

Graduate Teaching Assistant @ The University of Auckland

Teaching Assistant / Marker @ The University of Auckland

Education

Doctor of Philosophy - PhD, Statistics

The University of Auckland

Research

Master of Science - MS, Statistics

The University of Auckland

Research

Bachelor of Science Honours (BScH), Statistics

The University of Auckland

Research

Bachelor of Science (B.Sc.), Statistics/Computer Science

The University of Auckland

Papers

Batch Aggregation: An Approach to Enhance Text Classification with Correlated Augmented Data

Projects

Skills

Data Scientist &
Developer