Software Engineer
Summary
The team focuses on, improving read/write throughput for heterogeneous data sources/sinks, reducing latencies and bounding upper percentiles when communicating with data stores, adding new query optimization rules, choosing better default memory configurations for applications, JVMs, and OS
Added support for the benchmark tool to benchmark ad-hoc queries irrespective of the benchmark type
Modified our tool to enable submit spark job in a single session (single EMR Step) on EMR reducing the total execution time from 2hrs to 30 mins
Added various spark configs (executor and driver) to better handle OOM, JVM crashes and worked on better logging to make debugging easier when an error is encountered
Worked on benchmarking query outliers and its improvements due to various reasons like S3 throttling
Designed and implemented a CI/CD pipeline using AWS CDK on Faragte, enabling automatic execution of integration tests upon code commit
Worked on merging all new Open-source-spark (OSS) commits to our fork of Spark to get latest bug fixes, improvements etc
Expectations
Software engineering job to learn, grow and challenge myself
Employment Preferences
Expected Base Salary
**5,000 USD
Academic Degree
Experience
Total Professional Experience
Startup Experience
Big-Tech Companies
Enterprise Experience
Contacts are hidden
Send a connection request to the candidate to get their contact details.
Contact Candidate
