Hi, I'm Diggy aka
Digvijay Ghotane

a Data Scientist,

With a deep-seated passion for data, a keen aptitude for numerical analysis, and a seamless ability to engage with technology, I possess the power to orchestrate what many perceive as magic.

I specialize in informing data-driven strategies meticulously tailored to enhance decision-making processes for both individuals and esteemed organizations, leveraging rigorous statistical analysis to drive impactful outcomes.

Key Work Areas

Data-Driven Services

Business Strategy

• Precise Revenue Forecasting
• Strategic Target Setting
• Advanced Reporting Enchancement
• Customer & Inventory Management

Machine Learning

• Large Language Classifier Models
• Breakout Analysis for Public Equities
• Customer Behavioral Analysis
• OCR for Documents

Project Development

• Data Visualization & Dashboards
• A/B Testing
• Hypothesis Testing
• Statistical Modeling


Georgetown University

McCourt School of Public Policy

Master of Science, Data Science for Public Policy
August 2019 - May 2021

• Relevant Coursework:
• Microeconomics
• Statistics I & II
• Data Science I, II & III
• Data Visualization & Geographical Information Systems
• Massive Data Fundamentals
• Computational Linguistics

• Recevied the Merit based Graduate School Financial Aid Award.

• Managed the financial operations of the South Asian Policy and Research Initiative as Treasurer (2019-21).

• Advocated for student concerns and needs, serving as a liaison between students and faculty as GradGov Senator (2020-21).

Mumbai University

Bachelor of Engineering, Electronics and Telecommunication
August 2015 - May 2019

• Relevant Coursework:
• Applied Mathematics I, II, III & IV
• Structured & Object Oriented Programming
• Analog and Digital Electronics (I & II)
• Signals and Control Systems
• Wave Theory, RF Modeling, and Sat. Comm.
• Microprocessors & Microcontrollers
• Integrated Circuits & VLSI Design
• Computer Networks

• Inducted into the Hall of Fame (2018-19) for display of extra-curricular excellence.

• Organized, participated and judged over 50 Model UN Conferences across India (2015-2019).

• SYNC, Computer Society of India: Mumbai Chapter (2015-16): Designed and built a custom Arduino-based automated medication dispenser enabled with seamless operation through Bluetooth connectivity with smartphones for precise and reliable dosage delivery.

• The Management Society - TSEC (2016-17): Strategized and executed 3 major student event conferences involving over 1000 participants overall as the Debate Coordinator.

Professional Experience

DIA Associates

August 2021 - September 2023
New York, NY, USA

Collaborated with clients from cross-functional teams of a Fortune 500 multinational financial services corporation that specializes in payment cards to drive profitable business decisions and improve overall internal capabilities assisting their Global Commercial Services division (United States Region) by:

• Boosted overall customer spend targets by ~30% YoY by statistically analyzing KPIs, thereby building a robust forecasting model.

• Enhanced visibility on reporting requirements by 100% by enabling weekly/monthly reporting of -
• Performance tracking of Business Development Leaders,
• Tracking and consolidation of sources of Revenue via cross-segmentation of various cross-functional teams,
• Tracking the impact of Covid-19 and its trailing effects on various business segments,

• Improved management of ~10% of customer accounts specifically by analyzing Rolling-12 customer-cycle spending profit margins.

• Optimized resource allocation by strategizing the deployment of efforts toward replicating historically successful deals.

• Built an LLM classifier model with ~85% accuracy to understand actual customer engagement from ad-hoc tasks for reporting purposes.


Biocomplexity Institute, University of Virginia

Data Science for Public Good Fellow
May 2021 - August 2021
Arlington, VA, USA

• Led a team of 2 interns to classify innovation from news sources using LLMs to reduce the costs of surveys for the National Science Foundation.
• Our summer project focused on enhancing innovation measurement for the National Science Foundation by implementing advanced Natural Language Processing techniques. We aimed to supplement traditional survey data with alternative methods, successfully constructing a pipeline to identify innovation from text sources and extract relevant company information. This approach provides a real-time means of identifying innovation, offering a more dynamic perspective than traditional survey methods. Learn more.

• Built a comprehensive dashboard for presenting Arlington County Park Access Equity Data in collaboration with the Arlington government.
• In collaboration with Arlington County, Virginia, conducted an equity-focused study on park provision, analyzing demographic factors and travel accessibility to assess alignment with community needs. Learn more.


Georgetown University

Graduate Research Assistant
Security Studies Program, Edmund A. Walsh School of Foreign Service
November 2019 - May 2021
Washington, DC, USA

• Graduate Research Assistant for Professor C. Christine Fair • Co-authored with Prof. C. Christine Fair and Prof. Parina Patel an academic paper that was published in Small Wars and Insurgencies Journal in May 2021: Did India's demonetization policy curb stone-pelting in Indian-administered Kashmir?

• Co-authored article about the paper at Gateway House, Indian Council on Global Relations in June 2021: Linking demonetization and stone-pelting

• Created a novel time-series dataset of stone-pelting events in Indian Administered Jammu and Kashmir (J&K). (608 Downloads from the Harvard Dataverse as of March 2024.)

• Tabulated household survey data from Pakistan in STATA to study the positive correlation between education and income, as part of a separate project.


The World Bank

Data Publication Intern
Development Impact (DIME)
February 2021 - May 2021
Washington, DC, USA

Cleaned, wrangled, aggregated, and translated (from Portuguese to English) country-governance-related survey data (3 datasets, ~3200 variables) in STATA to be statistically analyzed to gain insight into irrigation-related development of a country in Africa.

The World Resources Institute

Data Science Intern
Climate Team
June 2020 - August 2020
Washington, DC, USA

Compiled disparate sources of data and produced a new normalized dataset to evaluate and quantify water use of ~500 thermoelectric plants across North America & Europe.


Soft Skills

Microsoft Office Suite
Languages: English, Marathi & Hindi

Hard Skills

SQL and PySpark
Tableau & PowerBI
Web Development: HTML, CSS, JavaScript
