Notifications

Are you open to new opportunities?

Upload your resume to increase your chances of getting noticed.

Bitus Labs
2.5
Senior Data Engineer (Chinese Mandarin Speaker)
Irvine, CA
$130K (Employer provided)
Easy Apply
This is a hands-on engineering role — you will write production code in Java and Python every day, while also contributing to platform design decisions,……
24h
Optimus Health Analytics
Data Engineer
United States
$80K - $110K (Employer provided)
Easy Apply
Experience supporting analytics engineering, dbt, machine learning pipelines, or feature engineering workflows. Strong proficiency in Python and SQL.…
24h
City Mattress
3.5
BI Developer/ Data Engineer
Bonita Springs, FL
Easy Apply
You'll work directly with our Sr. Manager of BI & Data Analytics to develop and maintain Power BI reports, support our Microsoft Fabric environment, and help……
19d
trssrecruiting@trssllc.com
Data Engineer
Burlington, VT
$115K - $138K (Employer provided)
Easy Apply
In this dynamic role, you will design, develop, and maintain scalable data pipelines and architectures that empower our organization to harness the full……
30d+
MetroPlusHealth
3.1
Data Engineer (SQL, Programming, Healthcare)
New York, NY
$121K - $131K (Employer provided)
Easy Apply
Bachelor’s degree in Information Systems, Computer Science, Database Management, or related field. The position of Data Engineer in Analytics and Reporting……
30d+
Younger Mfg
3.3
Principal Integration & Data Platform Engineer
Torrance, CA
$125K - $140K (Employer provided)
Easy Apply
This role provides senior technical leadership and works closely with enterprise applications, manufacturing engineering, and analytics stakeholders to……
26d
Inland Empire Health Plan
3.3
Data Engineer III
California
$119K - $157K (Employer provided)
Maintain process design artifacts like data flow diagrams, end user process maps and technical design documents. 457(b) option with a contribution match.…
30d+
JANSON Communications
3.3
Mid-Level Data Engineer (Kaiserslautern, Germany)
Fairfax, VA
$56K - $88K (Glassdoor est.)
Easy Apply
Excellent communication skills with the ability to explain complex technical concepts to non-technical stakeholders. Data engineering: 3 years (Required).…
30d+
Peraton
3.5
Cyber Operational Risk Analyst / Data Engineer
Fort Meade, MD
$135K - $216K (Employer provided)
Conduct quantitative analysis to the military decision-making processes by developing and applying probability models, statistical inference, simulations,……
30d+
Emergent Staffing
5.0
Senior Data Engineer (Azure & Databricks)
Bloomington, MN
$60.00 - $85.00 Per Hour (Employer provided)
Easy Apply
Contribute to documentation, architectural design, and continuous improvement of data engineering best practices.…
11d
Exelon
3.8
Sr Data Engineer 1 – Enterprise IT Data Analytics
Oakbrook Terrace, IL
$105K - $144K (Employer provided)
Proactively build technical knowledge and business acumen within own discipline or function. Bachelor's degree in computer science or related discipline and 4-7……
5d
MetroPlusHealth
3.1
Risk Adjustment Data Engineer (SQL, Health Plan)
New York, NY
$121K - $131K (Employer provided)
Easy Apply
Represent Risk Adjustment team in technical discussions with key stakeholders; translate business requirements into technical specifications.…
11d
S&N Communications
2.9
Cloud Data Engineer
Louisa, VA
$95K (Employer provided)
Easy Apply
Able to operate independently, make sound technical decisions, and own critical deliverables with limited oversight.…
24h
Peterson Holding Company
3.7
Data & Analytics Engineer
San Leandro, CA
$105K - $125K (Employer provided)
Easy Apply
Bachelor's Degree from a fully accredited college in Data Science, Information Technology, Engineering, or other closely related field; and a minimum of three (……
5d
Estis Compression
Quality Engineer/Data Analyst
Midland, TX
$63K - $94K (Glassdoor est.)
Easy Apply
Bachelors degree in Engineering or related field and a minimum of five years of Quality experience in a production/manufacturing environment is preferred.…
30d+
Summit Technologies & Solutions
4.0
Wind Tunnel Data System Engineer
Hampton, VA
$61K - $95K (Glassdoor est.)
Easy Apply
A valid driver’s license is required. Familiarity with engineering software tools such as LabVIEW and MATLAB is preferred.…
24h
Genmab
4.0
Senior Data Product Engineer, Commercial
Princeton, NJ
$132K - $197K (Employer provided)
The ideal candidate is conceptually strong in modern data architectures and principles, capable of adapting across tools rather than being limited by them and……
30d+
DCI Solutions
3.9
Data Pipeline Reliability Engineer
Washington, DC
$160K - $200K (Employer provided)
Easy Apply
Diagnose, resolve, and prevent issues encountered in the field. Triaging, troubleshooting, and coordinating the resolution of technical issues.…
19d
Globe Life
2.7
Data & AI Engineer III (Hybrid)
McKinney, TX
$83K - $120K (Glassdoor est.)
Easy Apply
Leveraging expertise in AWS services, advanced SQL, Python, Informatica, and Big Data technologies such as Hadoop and Spark, this role also serves as a……
20d
Central Point Partners
4.6
Senior Data Engineer (Microsoft Fabric & Power BI)
United States
$115K - $130K (Employer provided)
Easy Apply
Experience supporting construction, MEP, field service, or project-based operational environments. Support future-state analytics initiatives including……
20d
Steel Warehouse
3.2
Data Engineer
South Bend, IN
$71K - $109K (Glassdoor est.)
Easy Apply
In this dynamic role, you will design, develop, and optimize scalable data pipelines and architectures that empower organizations to harness the full potential……
30d+
Brooksource
3.9
Senior Data Engineer
Tampa, FL
$70.00 - $75.00 Per Hour (Employer provided)
Easy Apply
Serve as a technical lead on cross-functional data projects, guiding design decisions and ensuring alignment with business goals.…
7d
Sparks Group
3.9
Junior Data Engineer - Palantir
McLean, VA
$88K - $129K (Glassdoor est.)
Easy Apply
Bachelor's degree in computer science, data science, mathematics, or related technical field. You’ll use your experience in analytical exploration and data……
16d
General Dynamics Information Technology
3.9
Senior Data Engineer
Arlington, VA
$135K - $183K (Employer provided)
Best Places to Work
Your work will make it faster and easier for teams across engineering, analytics, and product groups to develop, deploy, and improve data-driven capabilities by……
24h
Always Compassionate Health
1.8
Senior Data Platform Engineer, Enterprise Data & Analytics
Melville, NY
$155K (Employer provided)
Easy Apply
Bachelor’s degree in computer science, engineering, information systems, or a related technical field. 3–7 years of experience in data engineering, data……
21d
Amazon
3.5
Systems Engineer, Controls Fleet, Data Center Capacity Delivery
Seattle, WA
$105K - $160K (Employer provided)
Problem Solving: Analyze complex technical issues and develop effective solutions through root cause analysis.…
30d+
Philip Morris International
3.9
Data Engineer - MDM
Tampa, FL
$88K - $110K (Employer provided)
Collaborate with Data Governance and business intelligence, driving engineering and technical implementation of the master data solution & services, ensuring……
11d
LVI Associates
3.7
Industrial Data Engineer
Bend, OR
$70K - $107K (Glassdoor est.)
Easy Apply
This role focuses on building reliable data pipelines, models, and reporting tools that turn plant and business data into actionable insights.…
22d
Professional Staffing Services
4.3
Data Engineer
Orlando, FL
$90K - $100K (Employer provided)
Easy Apply
*Cross-Functional Collaboration:* Partner closely with project managers, business analysts, data scientists, and key stakeholders to translate functional……
6d
Acra Lending
3.0
Data Engineer
Irvine, CA
$135K - $145K (Employer provided)
Easy Apply
This is a key position within the company and aligns with our data strategy, driving optimization, reliability, and robust business intelligence capabilities……
30d+

Bitus Labs

2.5

Senior Data Engineer (Chinese Mandarin Speaker)

Irvine, CA

$130K (Employer provided)

Is your resume a good match?

Use AI to find out how well the skills on your resume fit this job description.

About the Role
We are looking for a Senior Data Engineer to join our Data Platform team and take ownership of building and scaling our AWS-based data lakehouse. You will architect and deliver robust, production-grade data pipelines, work closely with data scientists, analytics engineers, and product teams, and set the technical direction for how data flows across the organization. This is a hands-on engineering role — you will write production code in Java and Python every day, while also contributing to platform design decisions, mentoring junior engineers, and driving best practices around data quality, reliability, and governance.

Responsibilities

Data Lakehouse Architecture & Development

Design and build scalable medallion-architecture data lakehouses (Bronze / Silver / Gold) on AWS S3 using Apache Iceberg table format.
Develop and maintain high-throughput ETL/ELT pipelines using AWS Glue, EMR (Spark), and Lambda.
Implement schema evolution, partitioning strategies, and compaction processes for Iceberg tables to optimize storage and query performance.
Write production-quality pipeline code in both Java and Python, selecting the appropriate language per performance and maintainability requirements.

Real-Time & Batch Streaming

Build and operate event-driven data pipelines using Amazon Kinesis Data Streams, Kinesis Firehose, or Apache Kafka (MSK).
Design exactly-once and at-least-once processing semantics for streaming workloads using Apache Flink or Spark Structured Streaming on EMR.

AWS Platform Engineering

Manage infrastructure as code using AWS CDK or Terraform for repeatable, auditable data platform deployments.
Optimize cost and performance across AWS services including S3, Glue, Athena, Redshift Spectrum, EMR, Lambda, Step Functions, and EventBridge.
Implement data security best practices: IAM least-privilege policies, KMS encryption, VPC networking, and Lake Formation fine-grained access control.
Build and maintain CI/CD pipelines for data workloads using AWS CodePipeline, GitHub Actions, or equivalent.

Data Quality & Governance

Implement data quality frameworks (e.g., Great Expectations, Deequ) and integrate validation steps into pipeline orchestration.
Define and enforce data contracts between producing and consuming systems.
Contribute to data cataloguing and lineage tracking using AWS Glue Data Catalog or Apache Atlas.

Collaboration & Technical Leadership

Partner with data scientists, ML engineers, and analysts to understand data requirements and deliver performant, well-documented datasets.
Mentor mid-level and junior engineers through code reviews, design discussions, and pair programming.
Document architecture decisions (ADRs) and contribute to internal engineering knowledge base.

Required Qualifications

Experience

5+ years of professional data engineering experience, with at least 3 years on AWS cloud platforms.
Proven track record of delivering production data pipelines at scale (TB+ datasets, highthroughput SLAs).
Experience with data lakehouse architectures — medallion pattern, open table formats (Iceberg preferred; Delta Lake or Hudi acceptable).

Programming Languages

Java: Strong command of Java (8+) for Spark jobs, custom Iceberg connectors, and performance-critical pipeline components. Familiarity with Maven/Gradle build systems.
Python: Proficient in Python 3 for AWS Glue scripts, orchestration logic, data quality checks, and automation tooling. Experience with pandas, PySpark, boto3, and packaging best practices.

AWS Core Services

Storage & Compute: S3, Glue (jobs, crawlers, Data Catalog), EMR (Spark/Flink), Lambda, EC2.
Streaming: Kinesis Data Streams, Kinesis Firehose, or MSK (Managed Kafka).
Orchestration: Step Functions, MWAA (Managed Airflow), or EventBridge Scheduler.
Querying: Athena, Redshift, or Redshift Spectrum.
Security & Governance: IAM, KMS, Lake Formation, Secrets Manager, VPC.
DevOps: AWS CDK or CloudFormation; CodePipeline or equivalent CI/CD tools.

Data Processing Frameworks

Apache Spark (PySpark and/or Spark Java API) — distributed transformations, performance tuning, memory management.
Apache Iceberg — table maintenance, time travel, snapshot management, partition evolution.
SQL — advanced SQL for data transformation, window functions, CTEs, query optimization.

Preferred / Nice to Have

AWS Certified Data Engineer – Associate or AWS Certified Solutions Architect certification.
Experience with dbt for SQL-based transformation layers on top of the lakehouse.
Familiarity with ML platform integration: feature stores (SageMaker Feature Store), model serving data needs, or MLflow experiment tracking.
Experience with real-time OLAP engines such as Apache Druid or ClickHouse.
Contributions to open-source data tooling or internal platform libraries.
Exposure to data mesh or data product thinking — defining domain ownership and data contracts.

Tech Stack at a Glance

Languages

Java (8+), Python 3

Cloud Platform

AWS (S3, Glue, EMR, Kinesis, Athena, Lambda, Step Functions, Lake Formation, CDK)

Processing

Apache Spark, Apache Flink, Spark Structured Streaming

Table Format

Apache Iceberg (primary), Delta Lake / Hudi (familiarity)

Streaming

Amazon Kinesis, MSK (Kafka), Kinesis Firehose

Orchestration

Apache Airflow (MWAA), AWS Step Functions

IaC & CI/CD

AWS CDK / Terraform, GitHub Actions / CodePipeline

Pay: From $130,000.00 per year

Benefits:

401(k)
401(k) matching
Dental insurance
Health insurance
Life insurance
Paid time off
Parental leave
Retirement plan
Vision insurance

Language:

Chinese (Required)

Ability to Commute:

Irvine, CA 92618 (Required)

Work Location: In person

See company reviews

Base pay

The minimum salary is $130K and the max salary is $130K.

$130K/yr (Employer provided)

Irvine, CA

If an employer includes a salary or salary range on their job, we display it as "Employer Provided". If a job has no salary data, Glassdoor displays a "Glassdoor Estimate" if available. To learn more about "Glassdoor Estimates," see our FAQ page.

Working here doesn’t have to be a secret

2.5

Recommend to a friend
Approve of CEO
CEO: 0 Ratings

Bitus Labs

2.5

Senior Data Engineer (Chinese Mandarin Speaker)

Irvine, CA

$130K (Employer provided)

Bitus Labs

2.5

Senior Data Engineer (Chinese Mandarin Speaker)

Irvine, CA

$130K (Employer provided)

Is your resume a good match?

Use AI to find out how well the skills on your resume fit this job description.

Responsibilities

Data Lakehouse Architecture & Development

Design and build scalable medallion-architecture data lakehouses (Bronze / Silver / Gold) on AWS S3 using Apache Iceberg table format.
Develop and maintain high-throughput ETL/ELT pipelines using AWS Glue, EMR (Spark), and Lambda.
Implement schema evolution, partitioning strategies, and compaction processes for Iceberg tables to optimize storage and query performance.
Write production-quality pipeline code in both Java and Python, selecting the appropriate language per performance and maintainability requirements.

Real-Time & Batch Streaming

Build and operate event-driven data pipelines using Amazon Kinesis Data Streams, Kinesis Firehose, or Apache Kafka (MSK).
Design exactly-once and at-least-once processing semantics for streaming workloads using Apache Flink or Spark Structured Streaming on EMR.

AWS Platform Engineering

Manage infrastructure as code using AWS CDK or Terraform for repeatable, auditable data platform deployments.
Optimize cost and performance across AWS services including S3, Glue, Athena, Redshift Spectrum, EMR, Lambda, Step Functions, and EventBridge.
Implement data security best practices: IAM least-privilege policies, KMS encryption, VPC networking, and Lake Formation fine-grained access control.
Build and maintain CI/CD pipelines for data workloads using AWS CodePipeline, GitHub Actions, or equivalent.

Data Quality & Governance

Implement data quality frameworks (e.g., Great Expectations, Deequ) and integrate validation steps into pipeline orchestration.
Define and enforce data contracts between producing and consuming systems.
Contribute to data cataloguing and lineage tracking using AWS Glue Data Catalog or Apache Atlas.

Collaboration & Technical Leadership

Partner with data scientists, ML engineers, and analysts to understand data requirements and deliver performant, well-documented datasets.
Mentor mid-level and junior engineers through code reviews, design discussions, and pair programming.
Document architecture decisions (ADRs) and contribute to internal engineering knowledge base.

Required Qualifications

Experience

5+ years of professional data engineering experience, with at least 3 years on AWS cloud platforms.
Proven track record of delivering production data pipelines at scale (TB+ datasets, highthroughput SLAs).
Experience with data lakehouse architectures — medallion pattern, open table formats (Iceberg preferred; Delta Lake or Hudi acceptable).

Programming Languages

Java: Strong command of Java (8+) for Spark jobs, custom Iceberg connectors, and performance-critical pipeline components. Familiarity with Maven/Gradle build systems.
Python: Proficient in Python 3 for AWS Glue scripts, orchestration logic, data quality checks, and automation tooling. Experience with pandas, PySpark, boto3, and packaging best practices.

AWS Core Services

Storage & Compute: S3, Glue (jobs, crawlers, Data Catalog), EMR (Spark/Flink), Lambda, EC2.
Streaming: Kinesis Data Streams, Kinesis Firehose, or MSK (Managed Kafka).
Orchestration: Step Functions, MWAA (Managed Airflow), or EventBridge Scheduler.
Querying: Athena, Redshift, or Redshift Spectrum.
Security & Governance: IAM, KMS, Lake Formation, Secrets Manager, VPC.
DevOps: AWS CDK or CloudFormation; CodePipeline or equivalent CI/CD tools.

Data Processing Frameworks

Apache Spark (PySpark and/or Spark Java API) — distributed transformations, performance tuning, memory management.
Apache Iceberg — table maintenance, time travel, snapshot management, partition evolution.
SQL — advanced SQL for data transformation, window functions, CTEs, query optimization.

Preferred / Nice to Have

AWS Certified Data Engineer – Associate or AWS Certified Solutions Architect certification.
Experience with dbt for SQL-based transformation layers on top of the lakehouse.
Familiarity with ML platform integration: feature stores (SageMaker Feature Store), model serving data needs, or MLflow experiment tracking.
Experience with real-time OLAP engines such as Apache Druid or ClickHouse.
Contributions to open-source data tooling or internal platform libraries.
Exposure to data mesh or data product thinking — defining domain ownership and data contracts.

Tech Stack at a Glance

Languages

Java (8+), Python 3

Cloud Platform

AWS (S3, Glue, EMR, Kinesis, Athena, Lambda, Step Functions, Lake Formation, CDK)

Processing

Apache Spark, Apache Flink, Spark Structured Streaming

Table Format

Apache Iceberg (primary), Delta Lake / Hudi (familiarity)

Streaming

Amazon Kinesis, MSK (Kafka), Kinesis Firehose

Orchestration

Apache Airflow (MWAA), AWS Step Functions

IaC & CI/CD

AWS CDK / Terraform, GitHub Actions / CodePipeline

Pay: From $130,000.00 per year

Benefits:

401(k)
401(k) matching
Dental insurance
Health insurance
Life insurance
Paid time off
Parental leave
Retirement plan
Vision insurance

Language:

Chinese (Required)

Ability to Commute:

Irvine, CA 92618 (Required)

Work Location: In person

See company reviews

Base pay

The minimum salary is $130K and the max salary is $130K.

$130K/yr (Employer provided)

Irvine, CA

Working here doesn’t have to be a secret

2.5

Recommend to a friend
Approve of CEO
CEO: 0 Ratings

Are you open to new opportunities?

4,556 Data engineer analytics jobs in United States

Base pay

Working here doesn’t have to be a secret

Base pay

Working here doesn’t have to be a secret