An aspiring engineer with 2+ years of professional experience, focusing on leveraging information technology and procurement solution,
with strong Python, SQL, Tableau, Excel, Data Warehouse, and Data Modeling skills.
Portfolio
Analytics & Visualization
Tableau dashboard - IMDB
Tableau dashboard - ABNB
Advanced Data mining Series
PCI - DSS
Business Strategy Analysis
of jumia
overview of data lake
Cybersecurity case analysis
Data mining & Visualization
Tech Projects
2018 - 2019
2019 - 2020
2018 - 2019
2019
2019
2019 - 2020
Alive APP is my first independently developed Andriod mobile App + AR interaction skeleton based on Unity3D and VisualStudio. It aims at helping people stick to good habits by earning rewards.
2019
2019 - 2020
Pet Dog Robot is a corporate project with students from EE and Ecom backgrounds. My duty was to design a JAVA and MySQL based sale management system with JDBC and Java GUI, implementing data collecting, processing, modeling, querying, developing, testing procedures.
2019
Online Shopping website database design is a highly normalized (in 4NF) relational database model under MySQL WorkBench. SQL Queries are made to justify the availability of the database design.
2019 - 2020
Course Tree App is a solution and data-driven WeChat mini-program, which integrates career, curriculum resources, online education, and book market resources, in order to guide students pursuing their future careers.
2020
2019 - 2020
Data structure package
combined code implementation of classic data structure problems with C.
2020
2019 - 2020
Scrapy shares the source code during my collecting of nationwide IoT devices data, using Scrapy crawler with Python.
2020
Campus Autopilot is based on Target Recognition, Computer Vision, and SLAM Technology - Vehicle Violation Detection Robot. It petrol on campus, judging car parking with stream processing, cloud storage, cloud computing, optimized algorithms. In this way, incorrectly parking private cars will be warned by notification via a mobile app.
2021
DNS RElay is to forward queries between resolvers and DNS servers. The constructed DNS relay can also screen by checking the IP Addresses and domain names, and comparing them with database stored data. The implementation is based on JAVA.
2021
2019 - 2020
Classification & clustering package
combined code implementation of DBSCAN, K-Means, Bayes, and Decision Tree algorithms with Python.
2022
Descriptions-industries deduction
is a runnable pre-API program based on Python. According to the business descriptions given, the program matches each company automatically with a specific industry. It is based on Machine Learning models( fasttext, KNN, LDA, Gensim), NLP, PCA, etc. The deduction accuracy is about 90%.
RFID ticket Check is a project focused on both hardware configurations of RFID middleware, as well as communications among readers and the server via RFID tag embedded tickets.
BOND credit Risks Analysis is my graduation project based on Python, Machine Learning, and Statistics. The models aim to predict the possibility of bond issuers failing to pay back to investors. Innovative points are - 1) advanced factor selection; 2)optimized ML models.

Eco-scan: A Sustainable Solution
Build end-to-end architecture for a sustainable solution based on Agile Project Management:
- Designed epics and user stories, UI/UX, business process;
- Constructed database schema;
- Software, Security and Deployment Architecture;
- Technology Stack, Architecture Execution plan and Operationalization
Work
2021.09 - Now

Assist to manage the student cafe about inventory, organizing, customer services, and services improvement with diverse co-workers and managers.
Working as a Student Assistant at
University of Washington
in Seattle, WA, United States.
2021.03 - 2021.09


I worked as a Data Analyst Intern at British Council in Beijing, China.

-
Achieved a 10K + savings by evaluating 300+ contracts and 100+ tenders within cases of diverse industries (e.g., education, business, and services), based on data analysis and visualization with advanced Excel and Power BI.
-
Reduced 80% of the workload for procurement teams by optimizing the purchasing process on JD B2B platform for office supplies, considering flow strategy, including process capacity and service efficiency parameters.
-
Successfully onboarded a new employee for business case reporting with data analysis, and coordinated schedules for a team with 5 category managers using prioritization, problem-solving, and communication skills.
Education
2019.07 - 2020.09



I worked as Data Engineer Intern at
Chinese Academy of Sciences for
IoT Devices Book Project & Vulnerability Reporting System
in Beijing, China.
-
Developed industry categorical APIs for a $1M+ national project, based on company name and business scope, with the combination of machine learning principles (e.g., Classification Intelligence, Dimension Deduction).
-
Prepared 95%+ coverage data for a nationwide IoT devices database maintenance with Scrapy crawler in Python, by automatically extracting devices’ detailed information from 100,000+ various websites in China.
-
Extracted real-time breaches information from NoSQL database with SQL queries, and parsed with NLP technology - Through constructing tree above the text semantic segments and filtering component types.
2020.07 - 2020.09


I tested software and hardware for a new product, a film scanner for dentistry. I connected closely with those engineers, discussing proper solutions for newly discovered bugs. When I left the position, the product was already open for sale. Here are a few highlight points:
-
Tested the supporting software for a product named i-Scan, a dental film scanner, designed a black box test plan for the software, and contributed 300+ uncommon bug records.
-
Tested the hardware of the i-Scan for tooth filming and scanning equipment, and found out potential problems of the equipment through tests.
-
Pushed forward the product development progress for 1 month+.

























