Case Study

Big Data & DevOps Explorations

A series of laboratory researches into distributed computing architectures and service orchestration.

The Problem

Bridging the gap between monolithic application logic and the distributed requirements of big data platforms.

The Approach

Completed deep-dives into Hadoop HDFS for distributed storage, Spark for in-memory processing, and Spring Boot for microservice development. Focused on HDFS data replication and Spark RDD transformations.

Technical Stack

Apache SparkHadoopSpring BootDistributed SystemsHDFSMicroservices

Challenges & Constraints

Outcome & Learnings

Gained specialized knowledge in high-concurrency systems which I now apply to CIBC's enterprise data platforms.