Resume
Tuan Nguyen Link to heading
Discoverer | Handicrafter | Data Engineer
Address: TanPhu, Ho Chi Minh
Mobile/Chat: (+84)904035003
Print this page as PDF
Introduction Link to heading
He has an extraordinary passion for discovering new things
He addicted to programming, cooking, and doing handicrafts
Currently Link to heading
As a Data Engineer with more than 5 years in IT industry experience. His primary responsibilities are helping build ETL pipelines, including modeling, collecting, processing, and provisioning data
Apart from his daily activities, He also help other operational teams with making automation tools to enhance the business working performance
Skills Link to heading
Data Engineering
- Extract data from various multiple data sources like: Remote Object Storage (Amazon S3), SFTP, relational databases, non-relational databases, Facebook Analytics, Google Analytics, social media contents, APIs, etc…
- Process many types of data: transactional data, high volume data, user events, logs
- Perform data transformation, masterizing data, data cleaning, data validation
- Build data pipeline for Batch and Stream processing
- Schedule data pipeline jobs orchestration using GoodData, Apache Airflow
- Design data model for data digestion
Data Analytics
- Design data model for analytics using Star schema, Snowflake schema
- Apply analytics queries to calculate business metrics
- Create interactive reports and dashboards
Data Operations
- Set up development, staging and production environment for data flows
- Implement and integrate client single sign-on authentication and authorization, data mart provisioning
- Config network rules, identity management on cloud
- Monitor and mantain jobs
- Apply automation trigger workflows
Project Management
- Follow Agile and SCRUM software development process
- Use versioning control to store and maintain source
- Prepare technical and non-technical documentation, training material for knowledge transfering to client
- Manage projects and tasks on JIRA. Track working progress and performance on Workfront
Technical
- Programming languages: Javascript, Ruby, Python
- Database: TiDB, Vertica, PostgreSQL, BigQuery
- Containerization: Docker
- Bash: Shell script, CLI commands
- Version control: Github
- ETL Tools: Apache Airflow, Talend, GoodData, Apache Nifi
- Streaming data: HDFS, Hadoop, Apache Spark, Apache Kafka
- Cloud services: AWS (S3, EC2, IAM, VPC, Lambda), Azure (Active Directory, Function App, ServiceBus)
- BI Tools: GoodData, PowerBI, Looker Studio
Others
- Scheduling and managing tasks for the highest productivity
- Design technical courses for training team members or end-users (clients)
- Self-learning, researching technical problems and solutions
- Work well in a multicultural office environment
Experience Link to heading
2020 - Present
KMS Solutions | Sr. Data Engineer
Project: GoodData’s client
Design and build data solutions for GoodData’s clients in various industries such as: Education, financial services, customer satisfication, retail, multimedia
- Design logical data model based on business intelligence analysis requirements using star schema
- Design and build ETL pipeline for centralizing/mastering data in the data warehouse using GoodDataSDK, GoodData Cloud Native
- Implement multiple types of connectors/downloaders/extractors to connect and collect data from various data sources using Ruby, Python
- Build up automation tools to configure processes and perform API calls for data management using shell script, programing languages
- Config processes integrations with client’s system on multiple cloud like SSO SAML, networking connections, data security in AWS, Azure, Google Cloud Platform
- Consulting/training/mentoring data solutions for team members/clients with self-service operations
- Support client’s business critical operations on demands via JIRA tickets
- Build reports, dashboards using Analytical Queries, MAQL
Project: F&B SAAS Client
Support client on technical operations with Product Analytics using Mixplanel
- Maintain and optimize data pipeline of user activities event data tracked from product/application to Mixpanel and Google BigQuery
- Support building product analysis reports using Insights, Funnels, Retentions and Flows reporting
Project: Financial & Banking Client
Build data solution POC for a Financial client base on financial data. Target to have risk analysis, debt collection performance analysis
- Build ETL pipeline to extract data from the client’s Oracle database, centralize data in Vertica that hosted on Amazon EC2 Instance. Used Talend Studio for ETL jobs
- Schedule cron and monitor jobs run in Linux Management Console server hosted on AWS EC2 using Shell scripts and Apache Airflow
- Utilize SQL for data transformation and leverage In-database Machine Learning Functions to train models and make Risk predictions
- Establish a connection to the database system and create dynamic reports and interactive dashboards for analyzing debt collection performance and gaining insights into customer background criteria
Project: Banking Business/Services Client
Build automated flows to digest data from multiple applications data/vendors sources to data warehouse, capture data changes, then return model for reporting and analytical purposes
- Build ETL pipeline to digest data from banking services and application using Apache Nifi
- Optimize flows for handling large size data files
- Set up integration with SMS/Email providers to handle notification requests on data changes
Project: Insurance Service Client
Build an event-driven Python based Parametric Engine that can use weather and satellite data to evaluate insurance claim check.
- Use Azure Functions and Microservices for application deployment
- Design and maintain application structure with best practice API design (OpenAPI)
- Enable application to align with 12 factors methodlogy in microservices
- Prepare unit tests, performance test
- Utilize Azure Service Bus for message-queue communication between functional steps
- Data store and retrieve using Azure Blob Storage
- Integrate with current microservices system, CI/CD flows (Sonar cloud, dependabots,…)
2019 - 2020
Freelance | Full Stack Developer
- Design, implement and maintain the Football Booking System based on the customer’s requirements
- Technical stack:
- Ubuntu cloud, docker
- Java spring boot
- JWT Authentication SSO
- RestfulAPI
- MySQL, Redis
- Website dashboard: ReactJS - Mobile: React Native
- Team size: 3 members
2015 - 2017
English Tutor
- Take responsibility for teaching English in a class of 5 - 10 high school students
Education Link to heading
2012 - 2015
University of Finance and Marketing
- BEc, Finance of Insurance and Investment
2016 - 2021
FPT University
- B.Eng., Software Engineering
Certifications Link to heading
IBM Data Engineering Professional Certificates
AWS Fundamentals SpecializationAWS Fundamentals Specialization
Modern Application Development with Java on AWS
Mixpanel Partner Certification
Project Management Principles and PracticesProject Management Principles and Practices
Awards Link to heading
2016
Top 2 | Final Round | Softskills Traning Program, Level Up - Sponsored by Samsung
2021
Professional Growth Award, recognized by Data and Analytics Department, KMS Solutions
2022
Certificate Of Appreciation, Techcon Organizer, KMS Group
Publications Link to heading
Blog articles Link to heading
2022
Bring Your Product to a Higher Level with Product Analytics
- A blog article to introduce product analytics and how it can help accelerate the business growth
- Published on Dec 1st, 2022
Webinars Link to heading
2022
Webinar | Optimize Customer Journey with Product Analytics
- An English-based webinar to share how to discover why Product Analytics is essential to your business and how to take advantage of it to improve the customer journey
- Organized on Dec 6th, 2022