Role Overview: We are seeking a skilled Data Engineer to join our team and play a crucial role in the development and maintenance of our recommendation system. As a Data Engineer, you will be responsible for designing and implementing the data infrastructure that powers the system, ensuring the seamless flow of data and the availability of high-quality information to drive personalized content recommendations.

Key Responsibilities:

  1. Design and implement a scalable, fault-tolerant, and highly available data pipeline to capture, process, and store user engagement data in real-time.
  2. Develop efficient data storage solutions, including the collisionless embedding table, to effectively represent and retrieve user data and content metadata.
  3. Optimize data processing and transformation workflows to enable the continuous training and adaptation of the recommendation model.
  4. Ensure the reliability, performance, and scalability of the data infrastructure to handle the growing volume and velocity of user interactions.
  5. Collaborate with the machine learning engineering team to understand their data requirements and provide the necessary data products to support the development and deployment of the recommendation system.
  6. Implement robust data monitoring, alerting, and troubleshooting mechanisms to maintain the overall health and reliability of the data ecosystem.
  7. Continuously explore and evaluate new data technologies, tools, and techniques to enhance the efficiency and capabilities of the data infrastructure.
  8. Document data pipelines, processes, and best practices to maintain system transparency and enable cross-team knowledge sharing.

Required Qualifications:

Desired Skills: