Comcast : Data Engineer 4-EB

Brief Description of position:

The Company

Founded in 1963, and headquartered in Philadelphia, Pennsylvania, Comcast Corporation (NASDAQ: CMCSA, CMCSK) is a global media and technology company with two primary businesses: NBCUniversal and Comcast Cable. NBCUniversal operates 30 news and entertainment cable networks, the NBC and Telemundo broadcast networks, television production operations, television station groups, Universal Pictures, and Universal Parks & Resorts.  Comcast Cable Communications, LLC (“Comcast nation’s largest video, high‑speed internet, and phone provider to residential and business customers under the XFINITY brand.  Comcast has invested in technology to build a sophisticated network that delivers the fastest broadband speeds, and brings Cable”) is the customers personalized video, communications, home management offerings, and business services. 




Website -

Senior Data Engineer, Software Development and Architecture

Comcast brings together the best in media and technology. We drive innovation to create the world’s best entertainment and online experiences. As a Fortune 40 leader, we set the pace in a variety of innovative and fascinating businesses and create career opportunities across a wide range of locations and disciplines. We are at the forefront of change and move at an amazing pace, thanks to our remarkable people, who bring cutting-edge products and services to life for millions of customers every day. If you share in our passion for teamwork, our vision to revolutionize industries and our goal to lead the future in media and technology, we want you to fast-forward your career at Comcast.


Job Summary:

We are actively seeking a diverse set of candidates to join our team! Billions of requests. Millions of Users. Petabytes of data. Come be part of Comcast's Enterprise Business Intelligence team! Our team crafts and builds highly performant software, data, and analytics solutions, low latency microservices, and operates application platforms that provide AI powered intelligence to various enterprise consumer platforms at Comcast both on-prem and in the cloud. Reliability and performance at this scale requires craftsmanship and sophistication. 
We are looking for an engineer who loves to code, understands technical requirements, collaborates on solutions, listens to users, and delivers technology solutions in a high velocity, dynamic, "always on" environment. Aside from software development in a fast-paced environment, performance tuning, platform optimizations, and automation is our passion. Our team values inclusiveness, collaboration, personal growth, and fun.

We are looking for engineers who truly enjoy coding and work hard to hone their craft.


  • Develop data driven solutions on big data in private and public cloud
  • Perform high volume data transformations on large volumes of datasets using massively parallel processing techniques
  • Deliver high throughput using various in memory compute and storage solutions
  • Modernize traditional ETL processes using in memory computing frameworks such as Spark (PySpark)
  • Deliver highly performant & reliable metadata driven, configurable solutions and workflows
  • Develop & Document Unit Tests and integrate regression testing suite in CICD pipelines
  • Manage and optimize various complex compute and ETL workflows to ensure reliable operations in production and non-production environments
  • Work with middleware services team, scrum team and other technical and non-technical teams to ensure the solutions designed/developed align with the overall product design and requirements
  • Constantly optimize the existing compute & ETL processes for efficiency and cost savings



  • Expertise with Python, PySpark, Scala, PyData, DataFrames, Jupyter Notebook and understands the fundamentals of functional programming language.
  • Excellent data modelling experience and understanding of core concepts of Big Data and has practical experience working with 100s of Terabytes to Petabytes of data
  • Solid experience and understanding of various core AWS services such as EC2, S3, EMR/Spark, Glue, SQS, Kinesis/Firehose, DynamoDB/DAX, Step functions, ElasticSearch, Athena and Redshift.
  • Hands on programming experience using AWS SDK (Boto3), Java SDK or CLI
  • Deep understand AWS Infrastructure, networking best practices, multi-region latency and fail over/disaster recovery strategy
  • Has practical experience transferring big data in and out of AWS, transforming and computing in AWS at scale
  • Well versed with devops practices and understands gitflow
  • Experience in developing distributed applications and dynamic workflows using serverless architecture
  • Analytical skills and experience in solving big data problems effectively using Spark/EMR
  • Experience with public cloud data sharing practices and security
  • Experience working with CICD pipelines
  • Bachelor’s Degree in Computer Science and Engineering

Desired Experience

  • Real time data streaming, batching and transformations on data in transit
  • Nice to have experience in infrastructure automation using Infrastructure as Code methodology and well versed with CloudFormation or Terraform
  • Data science Modeling, real time modeling frameworks



Disclaimer: The above information has been designed to indicate the general nature and level of work performed by employees in this role. It is not designed to contain or be interpreted as a comprehensive inventory of all duties, responsibilities and qualifications. Comcast is an EEO/AA/Drug Free Workplace. Comcast is an equal opportunity employer.

Minimum Qualification:
Maximum CTC (in lakhs per annum):
Mandatory SkillSet:
PySpark, Scala, AWS, Java, Python, ETL.


We believe in making Analytics Vidhya the best experience possible for Data Science enthusiasts. Help us by providing valuable Feedback.