The Problem
Bridging the gap between monolithic application logic and the distributed requirements of big data platforms.
The Approach
Completed deep-dives into Hadoop HDFS for distributed storage, Spark for in-memory processing, and Spring Boot for microservice development. Focused on HDFS data replication and Spark RDD transformations.
Technical Stack
Challenges & Constraints
Outcome & Learnings
Gained specialized knowledge in high-concurrency systems which I now apply to CIBC's enterprise data platforms.