Distributed Control System Specialist
Job Posting: Distributed Systems Specialist – Synthetic Data Infrastructure
Location: Remote (Open to all time zones)
Type: Full-time, Contract (1 year with potential for extension)
Start Date: ASAP
Work Hours: 40 hours/week, Monday–Friday
Travel: None required
About the Role
We are looking for a Distributed Systems Specialist to join our Synthetic Data Infrastructure team. This role is ideal for a generalist engineer with deep expertise in distributed systems who thrives in a fast-paced, collaborative environment. You will be instrumental in designing, developing, and optimizing large-scale data generation pipelines that support machine learning workflows.
Key Responsibilities
• Design and optimize data processing pipelines using DAGs/Graphs and distributed workers/jobs.
• Develop scalable solutions to handle large fan-out loads.
• Collaborate with cross-functional teams to identify and prioritize system improvements.
• Work closely with data scientists and engineers to integrate and scale solutions.
• Continuously identify and implement performance optimizations to reduce resource utilization.
• Contribute to scheduling systems and support pipeline submissions.
• Participate in on-call rotations and provide support for production systems.
Required Qualifications
• 5+ years of experience in software development, with a strong focus on distributed systems.
• Proven experience with large fan-out loads and distributed job execution.
• Proficiency in at least one of the following languages:
• Python (preferred)
• Rust
• C++
• Hack (PHP)
• Experience with cloud infrastructure (AWS or GCP).
• Strong problem-solving skills and ability to work independently.
• Excellent communication and collaboration skills.
• Bachelor’s degree in a technical field.
Preferred Qualifications
• Experience running compute clusters or building data infrastructure.
• Background in companies operating at large data scales.
• Familiarity with Digital Content Creation (DCC) tools like Blender.
• Experience with machine learning frameworks such as TensorFlow or PyTorch.
What Makes This Role Exciting
This is a unique opportunity to work on cutting-edge infrastructure that powers large-scale compute workloads. You’ll take ownership of critical systems, collaborate with a talented team, and make a tangible impact on performance, scalability, and reliability. If you enjoy solving complex problems and optimizing systems at scale, this role is for you.
Interview Process
• 2 Rounds Total:
• 1 Technical Coding Interview (45 minutes)
• 1 Behavioral Interview (30 minutes)
Apply Job!
Apply to this Job