ABOUT ME

Accomplished Data Scientist and Data Engineer with a demonstrated history of effectively leveraging Python and essential data science libraries such as Pandas, NumPy, SciPy, Scikit-Learn, and TensorFlow for data manipulation, analysis, and model development. Skilled in designing and implementing robust data pipelines using PySpark, SQL, and cloud-native tools like Azure Databricks and Azure Data Factory to support large-scale data processing. Proficient in crafting compelling visualizations, dashboards, and presentations with Power BI and Matplotlib to convey complex insights to diverse audiences, driving strategic decision-making. Experienced in building and optimizing ETL workflows, applying Medallion architecture (Bronze, Silver, Gold layers) for efficient data management and governance. Versed across all stages of the data lifecycle—from defining data requirements and orchestrating ingestion processes to deploying predictive models and analytical solutions for actionable business insights. Committed to ensuring the delivery of high-quality systems by continually refining workflows and methodologies. Capable of researching and validating innovative approaches and algorithms through prototyping, while also implementing data lineage and governance best practices (Databricks Unity Catalog, Delta Lake, and audit metadata). Adept at collaborating with cross-functional teams to deliver scalable, secure, and future-ready data platforms.

CERTIFICATIONS