Data Engineer
Full-Time
/
Office
/
Gangnam
Test Tag
Key Responsibilities
- Design and build data pipelines and infrastructure for collecting and processing video data required for AI model training
- Design and implement large-scale databases to efficiently manage video data and preprocessing workflows
- Develop systems for version control and cataloging of existing datasets and labels
- Build pipelines that enable the ML Engineering team to easily access and train on video datasets at scale
Requirements
- 3+ years of experience in software engineering (server/backend) or equivalent experience
- 1+ year of experience working with cloud platforms such as AWS, Azure, or GCP, or equivalent knowledge
- Experience with web data analysis and web crawling
- Hands-on experience with large-scale data processing and pipeline systems (e.g., BigQuery, S3-like object storage)
- Strong understanding of AI/ML concepts and hands-on experience training AI/ML models
- Passionate and open to learning new technologies
- Proven ability to collaborate and deliver results as part of a team
Preferred Qualifications
- Skilled in using Python or Go
- Familiarity with large-scale distributed processing tools (e.g., Hadoop, Spark)
- Direct involvement in creating datasets for training AI models
- Knowledge of video data workflows and processing techniques
- Hands-on development and deployment experience in containerized environments (e.g., Docker, Kubernetes)