- Startseite
- Remote Jobs
- Software Engineer, Data Infrastructure & Acquisition
Software Engineer, Data Infrastructure & Acquisition
Eckdaten
Arbeitsmodell
This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Software Engineer, Data Infrastructure & Acquisition based in Germany.
This role sits at the intersection of software engineering, data infrastructure, and applied AI, focusing on building and scaling the systems that power large-scale dataset acquisition for next-generation machine learning models. You will work in a fully distributed environment alongside engineers, researchers, and product leaders to design robust ingestion pipelines capable of handling massive, high-quality audio and text datasets.
Accountabilities
You will be responsible for building, maintaining, and scaling large-scale data ingestion and acquisition systems that support AI model training and product development. You will design and extend cloud-based infrastructure, optimize data pipelines, and ensure efficient processing of high-volume datasets across distributed systems.
- Build and maintain scalable data ingestion and processing pipelines
- Extend cloud infrastructure (GCP) using Infrastructure-as-Code tools
- Identify and integrate new data sources into acquisition systems
- Collaborate with research and AI teams to improve dataset quality and efficiency
- Optimize systems for cost, throughput, and reliability at scale
- Contribute to architecture and roadmap decisions for data infrastructure
Requirements
The ideal candidate brings strong software engineering experience with a focus on distributed systems, data infrastructure, or backend engineering in production environments.
- 5 years of software engineering experience
- Strong proficiency in Python and Linux environments (bash scripting)
- Experience with GCP and Infrastructure-as-Code (Terraform preferred)
- Hands-on experience with Docker and cloud-native development
- Exposure to large-scale data pipelines or web crawling systems (preferred)
- Strong problem-solving and system design skills
- Excellent communication and cross-functional collaboration abilities
- Degree in Computer Science or related technical field (BS/MS/PhD)
Benefits
- Competitive base salary with bonus and equity opportunities
- Fully remote, distributed-first work environment
- High-impact role working on AI systems used at global scale
- Opportunity to shape foundational data infrastructure for ML models
- Collaborative, engineering-driven culture with strong autonomy
- Access to cutting-edge AI and data engineering technologies
How Jobgether Works
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company.
Data Privacy Notice
By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR).
