BIG DATA MODELLING AND ANALYTICS
Implemented Steps to store, Query and process data stored in relational as well as non-relational databases. Worked with MySQL, PostGRE SQL and MongoDB, Neo4J.
Designed and Implemented Big Data Pipelines, Data Lakes, Hadoop HDFS with Map Reduce, Apache Spark on a VM Docker. I have been a part of two projects in the domain, working with US Storm data and Mobile Bike Sharing data. The projects involved research into the topics and datasets, comparision of the methods and tools available for the pipeline, designing the pipeline and conducting an exploratory analysis of the queried data.