I have a Master's Degree in Computer Engineering from Hanoi University of Technology. I am a data engineer with 9+ years of experience. I am motivated and enthusiastic by new computing challenges, esp. ones related to Big Data, AI & Blockchain.
I am flexible to project demands. I thrive in unfamiliar situations and enjoy opportunities to learn and gain exposure to new ideas and experiences. I am open and willing to learn whatever is necessary to accomplish goals.
Knowledge & Proficiency
Big Data stack
Programming & Proficiency
HTML & CSS
Led a team to build a massive-scale social media monitoring system using open source components including: crawlers (written in Java & Scala), messaging (using ActiveMQ & Protobuf), analytics (using Spark, Python Scikit-learn) and storage (with HBase, Redis, ElasticSearch, MySQL)
Optimized performance for Python microservice using ZeroMQ & Greenlet (Apache Thrift does NOT suit my needs)
Researched on Statistics, NLP (Natural language processing). I worked under the mentorship of several PhDs. They helped me efficiently learn & improve my knowledge on AI.
Improved the in-house Vietnamese tokenizer
Developed algorithms based on Stanford CoreNLP for matching (token level) documents, extracting snippets...
Trained predictive models with ScikitLearn, XGBoost & Gensim for various projects like document categorization, sentiment analysis.
Conducted various researches on Big Data technologies (Hadoop stack), Machine Learning & Text Mining algorithms such as SVM, Neural Network, Ensemble method, TF-IDF … for detecting trends, related topics...
Led a project of building a large-scale crawler based on Nutch for collecting information from web and social networks
Worked directly with international technology partners
Used ANTLR to develop a language ,which is scriptable and based on context free grammar, to generate data for fuzzing.
Involved in many phases to design a product such as UI/UX, High-performance backends... I've got in-depth knowledge of systems and assisted technologies. Additionally, I'm very interested in designing business model for products.
Via forensics and investigation, I helped many people secure their computing systems & identities
Designed core content, prepared syllabus for intern students
Gave lectures on Network programming & Security
Researched extensively on Application Security, Reverse Engineering, Windows kernel system...
Discovered several critical vulnerabilities in popular applications. The very first vulnerability of Google Chrome was submitted by me
Presented my paper entitled "Unicode shellcode and improvements" at Bellua Cyber Security conference (2008). In the paper I demonstrated a novel idea on optimizing size of Unicode shellcodes.
My public things
Here are several things I do in my leisure time
- Reading books about technologies & entrepreneurship
- Self-studying several topics I like such as Machine Learning, Blockchain ...
- Contributing to various open source projects like: MQTT.js, Gensim, HapiJS
- Managing several technology communities I founded such as "Vietnam Data Scientists" (since 2013), "Vietnam Blockchain Experts" (since 2016)
My digital traces in the online world
- ComputerWorld: Google issues first patches for Chrome
- F-Secure: Microchip MPLAB IDE Buffer Overflow Vulnerability
- Article: Building a Search Text Box Control with WPF
- MQTT.js contributor
- Gensim contributor
- Introduction to Zookeeper (slideshare)
- Vietnam Data Scientists community
- Vietnam Blockchain Experts community