
Senior Data Engineer (Data Warehouse) - Web3
BinancePosted 5/20/2025

Senior Data Engineer (Data Warehouse) - Web3
Binance
Job Location
Job Summary
We are seeking a Senior Data Engineer to design and develop a flexible, scalable data warehouse aligned with company specifications and business requirements. The ideal candidate will have 5+ years of experience in designing and developing data lakes and data warehouse solutions, proficiency in Java, Scala or Python, and strong Hive & Spark SQL programming skills. They will collaborate closely across business and technical teams to architect and maintain a highly flexible enterprise-scale data warehouse that accelerates insights and minimizes redundant work. The Senior Data Engineer will lead data governance initiatives by building and maintaining metadata management and data quality monitoring systems. They will foster technical team growth through mentorship, knowledge sharing, and continuous improvement of collective skills. With experience managing petabyte-scale data in Internet-scale environments and resolving critical production incidents, the ideal candidate will thrive in a results-driven workplace with opportunities for career growth and continuous learning. Binance offers a competitive salary, company benefits, and work-from-home arrangement.
Job Description
Responsibilities
- Architect and implement a flexible, scalable data warehouse aligned with company specifications and business requirements, accelerating delivery and reducing redundant development.
- Design, develop, test, deploy and monitor data models and ETL jobs; rapidly troubleshoot complex issues and optimize calculation logic and pipeline performance.
- Lead data governance initiatives by building and maintaining metadata management and data quality monitoring systems.
- Foster technical team growth through mentorship, knowledge sharing and continuous improvement of collective skills.
Requirements
- 5+ years of hands-on experience designing and developing data lakes and data warehouse solutions.
- Deep expertise in data warehouse modeling and governance, including dimensional modeling, information factory (data vault) methodologies and “one data” principles.
- Proficiency in at least one of Java, Scala or Python, plus strong Hive & Spark SQL programming skills.
- Practical experience with OLAP engines (e.g., Apache Kylin, Impala, Presto, Druid).
- Proven track record in building high-throughput batch pipelines on Big Data platforms.
- Familiarity with core Big Data technologies (Hadoop, Hive, Spark, Delta Lake, Hudi, Presto, HBase, Kafka, Zookeeper, Airflow, Elasticsearch, Redis).
- AWS Big Data service experience is a plus.
- Strong analytical and system-design capabilities, with a clear understanding of business requirements and ability to abstract and architect solutions.
- Collaborative mindset, skilled at building partnerships across teams and stakeholders.
- Preferred: Experience managing petabyte-scale data in Internet-scale environments and resolving critical production incidents.
- Bilingual English/Mandarin is required to be able to coordinate with overseas partners and stakeholders.