Registered
Prizes
Apache Spark is an open-source and distributed processing system used for big data workloads. It utilizes in-memory caching, and optimized query execution for fast analytic queries against data of any size. It provides development APIs in multiple programming languages like Java, Scala, Python and R, and also supports code reuse across multiple workloads.
In this DataHour, Akshay will be providing an overview of Apache Spark and its capabilities as a distributed computing system. Additionally, we will delve into internal data processing using Spark and explore techniques for performance tuning of Spark jobs. Also, this session aims to cover the concepts of parallel computing and how they relate to working with big data in Spark. This DataHour is for students and professionals looking to gain a deeper understanding of big data processing using Spark.
Prerequisites: Passion to learn data science, and an eagerness to take on new challenges.
Akshay Chauhan
Lead Data Engineer at Royal Bank of Scotland Business
Akshay is a Data Engineering Leader at Royal Bank of Scotland Business with experience in various industries like- Telecom, Public Service, Finance, Internet etc. He also possesses over a decade of experience in data engineering at multiple renowned companies like Accenture, Sapient and Royan Bank of Scotland. He is currently a research scholar at IIM Lucknow and has authored various research papers and also has served as a technical reviewer in many books.
Connect with him on Linkedin
Please register/login to participate in the contest
Please register to participate in the contest
Please register to participate in the contest