Professional Data Engineer with over five years of Software development experience specializing in Database technologies, Data wrangling, Data pipeline, and analytics.
Building applications with a focus on performance, availability, and scalability for mission-critical applications.
Python (pandas, numpy, dask, or pyspark), Java, knowledge of C++, SQL, Hive, HBase, Hadoop (MapR, Cloudera), Spark, Kafka, Redis, ZeroMQ, Flask, Docker
Machine Learning (Scikit-learn), Microservice architecture, Data Visualization (Tableau), Spring boot, Presto, Public clouds AWS (S3, EC2, EMR, Lambda, Redshift, DynamoDB) and GCP (BigQuery, BigTable, DataPrep), Airflow/dataswarm, Scala
Data structures and algorithms, Scaling and distributed computing, Cloud computing, Concurrency, Parallelism, Threads and Processes, CAP theorem, In-memory cache, Serverless, JVM(Java Virtual Machine)
Academic Degree

6 years

4 years

1 year

4 years
