Data Engineer
Amazon
Description
We're seeking an exceptional Data Engineer to join our Risk and Compliance Solutions (RCS) Data Engineering team, where you'll architect and build data systems that power Amazon's compliance operations. This role combines technical expertise with business acumen to transform complex data into actionable insights that protect Amazon's ecosystem of buyers, brands, and sellers.
Your work will directly influence Amazon's compliance framework and risk management capabilities, ensuring the company's continued growth while maintaining regulatory compliance. You'll be instrumental in building the next generation of data-driven compliance tools that protect Amazon's global marketplace. Your experience with real-time data processing, high-throughput systems, and end-to-end platform development. Knowledge of modern data engineering tools and technologies is essential.
The ideal candidate combines technical excellence with strategic thinking, bringing both the ability to architect complex systems and the vision to drive innovation in compliance technology.
Core Responsibilities:
Contribute to the architecture, design and implementation of next generation BI solutions – including streaming data applications.
Manage AWS resources including EC2, RDS, Redshift, Kinesis, EMR, Lambda etc.
Collaborate with data scientists, BIEs and BAs to deliver high quality data architecture and pipelines.
Interface with other technology teams to extract, transform, and load data from a wide variety of data sources
Continually improve ongoing reporting and analysis processes, automating or simplifying self-service support for customers
Basic Qualifications:
Bachelor's degree in computer science, engineering, mathematics, or a related technical discipline
Industry experience in software development, data engineering, business intelligence, data science, or related field with a track record of manipulating, processing, and extracting value from large datasets
Experience using big data technologies (Hadoop, Hive, Hbase, Spark, EMR, etc.)
Experience working with AWS big data technologies (EMR, Redshift, S3, AWS Glue, Kinesis and Lambda for Serverless ETL)
Knowledge of data management fundamentals and data storage principles
Knowledge of distributed systems as it pertains to data storage and computing
Hands-on experience and advanced knowledge of SQL
Basic scripting skills using Python and Scala
Basic understanding of Machine Learning
Key job responsibilities
• Design and implement scalable data infrastructure supporting RCS's compliance and risk management initiatives
• Develop robust data pipelines and analytics processes that enable real-time decision making
• Collaborate with compliance officers, software engineers, and product managers to deliver reliable data solutions
• Lead technical initiatives and mentor team members in best practices for data engineering
• Create automated systems to replace manual processes and support Amazon's global expansion