·· ExcellentData Analysis skills. Must be comfortable with querying and analyzing largeamount of data on Hadoop HDFS using Hive and Spark.
· Experiencewith Object oriented programming using python and its design patterns.
· Experiencehandling Unix systems, for optimal usage to host enterprise web applications.
· Expertin SQL and deep understanding of relational databases and Strong experience inperformance tuning SQL
· Strongunderstanding of Hadoop internals
· Expertin Hadoop: HDFS, Hive, MapReduce, HiveQL, Spark SQL
· Knowledgeof scripting languages – Python
· CreatingETL pipelines using SQL, Python, Hive, and Spark to populate data models