Data Engineer Intern - Marketplace Intelligence & Data - Algorithm Data (Summer 2026)
Not sure if you're a good fit?
Upload your resume and TixelJobs AI will compare it against Data Engineer Intern - Marketplace Intelligence & Data - Algorithm Data (Summer 2026) at Shopee. Get a match score, missing keywords, and improvement tips before you apply.
Free preview · Your resume stays private
About the Role
The Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most suitable technologies. Our engineers do not merely solve problems at hand; We build foundations for a long-lasting future. We don't limit ourselves on what we can or can't do; we take matters into our own hands even if it means drilling down to the bottom layer of the computing platform. Shopee's hyper-growing business scale has transformed most "innocent" problems into huge technical challenges, and there is no better place to experience it first-hand if you love technologies as much as we do.
Algorithm Data Engineering Team (Algo Data Team)
-
Feature Platform & Feature Store Construction:
- Lead or participate in the design, development, and maintenance of enterprise-level feature platforms / feature stores for both traditional models and LLMs. Address challenges such as online-offline feature consistency, real-time performance, and availability. Standardize and automate feature engineering pipelines to improve the efficiency of algorithm teams.
-
High-Quality Dataset Construction and Maintenance:
- Design and build high-performance, low-latency offline and real-time datasets for model training, evaluation, and online inference scenarios. This includes pre-training dataset construction, data filtering, data quality evaluation, data augmentation, and automated evaluation pipelines.
-
Algorithm Experimentation and Monitoring Pipelines:
- Participate in building and maintaining the core data pipelines for algorithm experiments, providing end-to-end support from data preparation, configuration, and execution monitoring to metric analysis and result interpretation.
-
High-Value Label and Knowledge Graph Mining:
- Leverage deep understanding of e-commerce business and algorithms to mine high-value user profiles, item labels, and relationship graphs from massive behavioral data, effectively feeding back into model optimization and business strategy.
- Currently pursuing a Bachelor’s degree in Computer Science, Artificial Intelligence, or related fields.
- Familiar with one or more big data technologies such as Spark, Flink, Hadoop, HBase, Kafka, Druid, ClickHouse.
- Excellent logical thinking, communication, project management, and cross-team coordination skills.
- Highly self-motivated, resilient under pressure, and eager to continuously explore and drive business breakthroughs.
Good to have:
- Experience with LLM pre-training data pipelines, Data Lake, Data Flywheel, or vLLM.
- Background in model evaluation (benchmarks) and model training (pre-training) is a strong plus.
Ready to apply?
This job is active. Apply now to get in early.