TikTok logo

Data Engineer, Lark - USDS

TikTok

Posted 2 days ago

About this role

About the team
Our team is dedicated to driving product excellence through big data, ensuring that Lark remains the premier communication and productivity engine for a global workforce. You will navigate a high-impact environment where you design high-quality real-time and offline data warehouses, curate efficient data assets, and implement technical solutions that optimize internal workflows. Your work provides the backbone for daily operations and data-driven decision-making, directly enhancing the productivity of hundreds of thousands of employees worldwide.
As a Data Engineer on the US Tech and Product - Lark team, you will have the unique opportunity to build, optimize, and scale the comprehensive data platform powering powering an all-in-one collaborative workspace that redefines global enterprise productivity. Lark cohesively integrates high-performance Chat, Docs, Meetings, and cutting-edge Lark AI into a single, cohesive environment.

Responsibilities:
- Design and implement high-concurrency, real-time streaming pipelines (using Flink, Spark Streaming, or Kafka) that serve as the neural network for the Lark ecosystem.
- Lead the design and maintenance of high-quality offline and real-time data warehouses, ensuring robust ETL/ELT processes that consolidate siloed data from across the Lark suite into a unified source of truth.
- Implement automated data validation frameworks and monitoring tools to ensure the accuracy, consistency, and lineage of critical business metrics that drive executive decision-making.
- Research and deploy industry-leading distributed computing frameworks (e.g., Flink, Spark, Presto) to provide a high-performance foundation for massive-scale internal telemetry and cross-functional data assets.
- Analyze complex user requirements to architect and develop high-performance backend software solutions using Java or Go; apply advanced Data Structures, Algorithms, and Microservices principles to optimize workplace productivity and cross-platform data flow.
- Troubleshoot complex production failures and conduct deep-dive bottleneck analysis for distributed engines; design self-healing mechanisms and performance tuning strategies to ensure the 24/7 stability of Lark’s global data infrastructure.
- Establish proactive alerting and monitoring to ensure pipeline stability and the security of internal employee data.

Job details

Workplace

Office

Location

San Jose, California, United States

Job type

Full Time

Similar

Company

Jobr Assistant extension

Get the extension →