Design and build scalable data processing infrastructure. Optimize performance, reliability, and cost efficiency. Work on core synchronization engine, connector framework, and monitoring systems. Ensure system can handle petabytes of data with high availability.