Resume

Tuan Nguyen Link to heading

Discoverer | Handicrafter | Data Engineer
Address: TanPhu, Ho Chi Minh
Mobile/Chat: (+84)904035003
Print this page as PDF

Introduction Link to heading

He has an extraordinary passion for discovering new things
He addicted to programming, cooking, and doing handicrafts

Currently Link to heading

As a Data Engineer with more than 5 years in IT industry experience. His primary responsibilities are helping build ETL pipelines, including modeling, collecting, processing, and provisioning data
Apart from his daily activities, He also help other operational teams with making automation tools to enhance the business working performance

Skills Link to heading

Data Engineering

  • Extract data from various multiple data sources like: Remote Object Storage (Amazon S3), SFTP, relational databases, non-relational databases, Facebook Analytics, Google Analytics, social media contents, APIs, etc…
  • Process many types of data: transactional data, high volume data, user events, logs
  • Perform data transformation, masterizing data, data cleaning, data validation
  • Build data pipeline for Batch and Stream processing
  • Schedule data pipeline jobs orchestration using GoodData, Apache Airflow
  • Design data model for data digestion

Data Analytics

  • Design data model for analytics using Star schema, Snowflake schema
  • Apply analytics queries to calculate business metrics
  • Create interactive reports and dashboards

Data Operations

  • Set up development, staging and production environment for data flows
  • Implement and integrate client single sign-on authentication and authorization, data mart provisioning
  • Config network rules, identity management on cloud
  • Monitor and mantain jobs
  • Apply automation trigger workflows

Project Management

  • Follow Agile and SCRUM software development process
  • Use versioning control to store and maintain source
  • Prepare technical and non-technical documentation, training material for knowledge transfering to client
  • Manage projects and tasks on JIRA. Track working progress and performance on Workfront

Technical

  • Programming languages: Javascript, Ruby, Python
  • Database: TiDB, Vertica, PostgreSQL, BigQuery
  • Containerization: Docker
  • Bash: Shell script, CLI commands
  • Version control: Github
  • ETL Tools: Apache Airflow, Talend, GoodData, Apache Nifi
  • Streaming data: HDFS, Hadoop, Apache Spark, Apache Kafka
  • Cloud services: AWS (S3, EC2, IAM, VPC, Lambda), Azure (Active Directory, Function App, ServiceBus)
  • BI Tools: GoodData, PowerBI, Looker Studio

Others

  • Scheduling and managing tasks for the highest productivity
  • Design technical courses for training team members or end-users (clients)
  • Self-learning, researching technical problems and solutions
  • Work well in a multicultural office environment

Experience Link to heading

2020 - Present KMS Solutions | Sr. Data Engineer

Project: GoodData’s client
Design and build data solutions for GoodData’s clients in various industries such as: Education, financial services, customer satisfication, retail, multimedia

  • Design logical data model based on business intelligence analysis requirements using star schema
  • Design and build ETL pipeline for centralizing/mastering data in the data warehouse using GoodDataSDK, GoodData Cloud Native
  • Implement multiple types of connectors/downloaders/extractors to connect and collect data from various data sources using Ruby, Python
  • Build up automation tools to configure processes and perform API calls for data management using shell script, programing languages
  • Config processes integrations with client’s system on multiple cloud like SSO SAML, networking connections, data security in AWS, Azure, Google Cloud Platform
  • Consulting/training/mentoring data solutions for team members/clients with self-service operations
  • Support client’s business critical operations on demands via JIRA tickets
  • Build reports, dashboards using Analytical Queries, MAQL

Project: F&B SAAS Client
Support client on technical operations with Product Analytics using Mixplanel

  • Maintain and optimize data pipeline of user activities event data tracked from product/application to Mixpanel and Google BigQuery
  • Support building product analysis reports using Insights, Funnels, Retentions and Flows reporting

Project: Financial & Banking Client
Build data solution POC for a Financial client base on financial data. Target to have risk analysis, debt collection performance analysis

  • Build ETL pipeline to extract data from the client’s Oracle database, centralize data in Vertica that hosted on Amazon EC2 Instance. Used Talend Studio for ETL jobs
  • Schedule cron and monitor jobs run in Linux Management Console server hosted on AWS EC2 using Shell scripts and Apache Airflow
  • Utilize SQL for data transformation and leverage In-database Machine Learning Functions to train models and make Risk predictions
  • Establish a connection to the database system and create dynamic reports and interactive dashboards for analyzing debt collection performance and gaining insights into customer background criteria

Project: Banking Business/Services Client
Build automated flows to digest data from multiple applications data/vendors sources to data warehouse, capture data changes, then return model for reporting and analytical purposes

  • Build ETL pipeline to digest data from banking services and application using Apache Nifi
  • Optimize flows for handling large size data files
  • Set up integration with SMS/Email providers to handle notification requests on data changes

Project: Insurance Service Client
Build an event-driven Python based Parametric Engine that can use weather and satellite data to evaluate insurance claim check.

  • Use Azure Functions and Microservices for application deployment
  • Design and maintain application structure with best practice API design (OpenAPI)
  • Enable application to align with 12 factors methodlogy in microservices
  • Prepare unit tests, performance test
  • Utilize Azure Service Bus for message-queue communication between functional steps
  • Data store and retrieve using Azure Blob Storage
  • Integrate with current microservices system, CI/CD flows (Sonar cloud, dependabots,…)

2019 - 2020 Freelance | Full Stack Developer

  • Design, implement and maintain the Football Booking System based on the customer’s requirements
  • Technical stack:
    • Ubuntu cloud, docker
    • Java spring boot
    • JWT Authentication SSO
    • RestfulAPI
    • MySQL, Redis
    • Website dashboard: ReactJS - Mobile: React Native
  • Team size: 3 members

2015 - 2017 English Tutor

  • Take responsibility for teaching English in a class of 5 - 10 high school students

Education Link to heading

2012 - 2015 University of Finance and Marketing

  • BEc, Finance of Insurance and Investment

2016 - 2021 FPT University

  • B.Eng., Software Engineering

Certifications Link to heading

IBM Data Engineering Professional Certificates

AWS Fundamentals SpecializationAWS Fundamentals Specialization

Modern Application Development with Java on AWS

Mixpanel Partner Certification

Project Management Principles and PracticesProject Management Principles and Practices

Awards Link to heading

2016 Top 2 | Final Round | Softskills Traning Program, Level Up - Sponsored by Samsung

2021 Professional Growth Award, recognized by Data and Analytics Department, KMS Solutions

2022 Certificate Of Appreciation, Techcon Organizer, KMS Group

Publications Link to heading

Blog articles Link to heading

2022 Bring Your Product to a Higher Level with Product Analytics

  • A blog article to introduce product analytics and how it can help accelerate the business growth
  • Published on Dec 1st, 2022

Webinars Link to heading

2022 Webinar | Optimize Customer Journey with Product Analytics

  • An English-based webinar to share how to discover why Product Analytics is essential to your business and how to take advantage of it to improve the customer journey
  • Organized on Dec 6th, 2022