
Senior Software Developer (Data Engineer) - M/BDO4-PU
- Hong Kong
- Permanent
- Full-time
- Build and maintain robust data pipelines to integrate data collection, preprocessing, embedding, and retrieval processes. Ensure smooth and scalable operations across the system to support large-scale data handling and model integration.
- Collaborate with Al researchers and data scientists to understand data requirements and deliver high-quality datasets for model training and evaluation.
- Build and maintain data architectures
- Ensure data is pre-processed, cleaned, and transformed in preparation for machine learning models.
- Optimize data workflows to improve performance and reduce latency in Al model training and inference.
- Develop and maintain automated ETL (Extract, Transform, Load) processes to support data availability and consistency.
- Work with cloud platforms (AWS, GCP, Azure) to manage and scale data storage and processing systems.
- Ensure data governance, security, and compliance with data privacy regulations.
- Continuously monitor and improve the performance of data pipelines to meet the evolving needs of Al projects.
- Bachelor's or Master's degree in Computer Science, Data Engineering, or a related field.
- Proven experience as a Data Engineer or similar role, ideally with a focus on Al, machine learning, or big data.
- Strong programming skills in languages such as Python or Java
- Experience with data processing frameworks (e.g., Apache Spark, Hadoop) and real-time data streaming (e.g., Kafka)
- Familiarity with natural language processing techniques and tools (e.g., tokenization, embeddings, vectorization)
- Solid understanding of relational and NoSQL databases (e.g., MySQL, MongoDB)
- Experience with cloud services (AWS, GCP, Azure)
- Knowledge of data warehousing and data lake architectures.
- Familiarity with machine learning pipelines and Al tools (e.g., TensorFlow, PyTorch)
- Strong problem-solving and troubleshooting skills.
- Excellent communication skills and the ability to collaborate with cross-functional teams.