Independent Data & AI Consultant

Hi, I'm Diggy

Digvijay Ghotane

a Data Scientist

I build the pipelines that move data, the models that learn from it, and the analysis that turns it into decisions.

Digvijay Ghotane
01 / 10 Now Aug 2024 — Present

Team Lead — AI Evaluation & Agent Benchmarking

Leading a team of data scientists that builds and runs the data and evaluation infrastructure behind frontier AI and agentic systems — designing the pipelines that turn model-interaction data into structured benchmarks, and the quality controls that keep large-scale evaluation trustworthy. Engaged as an independent contractor.

02 / 10 Professional Experience Aug 2021 — Sep 2023

DIA Associates

Analyst · New York, NY

Embedded with the Global Commercial Services division of a Fortune-500 payments company as a management-consulting analyst:

• Built ML forecasting models (regression, decision trees, random forest) that lifted customer-spend targets ~30% YoY.

• Developed an SVM classifier (~90% accuracy) to gauge genuine customer engagement from managers' ad-hoc reporting.

• Automated divisional reporting into a single-source-of-truth data pipeline — customer-data visibility up 100%.

03 / 10 Research & Fellowships May 2021 — Aug 2021

UVA · Biocomplexity Institute

Data Science for Public Good Fellow · Arlington, VA

• Led 2 interns building an NLP pipeline (BERT/RoBERTa models) to classify innovation from news for the National Science Foundation. Learn more.

• Built an Arlington County park-access equity dashboard with the county government. Learn more.

Rewind — this fellowship's last day was Friday, Aug 6, 2021; DIA began the following Monday, Aug 9. The research years start here.

04 / 10 Research & Fellowships Feb 2021 — May 2021

The World Bank

Data Publication Intern, DIME · Washington, DC

Cleaned, wrangled, and translated (Portuguese → English) country-governance survey data — 3 datasets, ~3,200 variables — in STATA, for analysis of irrigation-led development in Mozambique.

05 / 10 Research & Fellowships Feb 2021 — May 2021

Georgetown · Massive Data Institute

Washington, DC

Built an AWS Lambda OCR pipeline parsing DC housing-eviction court records into a structured dataset, supporting analysis of the socio-economic factors behind DC evictions.

06 / 10 Research & Fellowships Aug 2020 — May 2021

Georgetown University

Teaching Assistant — Intro to Data Science (Prof. Brodnax) · Washington, DC

Supported instruction and mentored students for the graduate Introduction to Data Science course.

07 / 10 Research & Fellowships Jun 2020 — Aug 2020

The World Resources Institute

Data Science Intern, Climate · Washington, DC

Compiled disparate sources into a normalized dataset quantifying water use of ~500 thermoelectric plants across North America & Europe.

08 / 10 Research & Fellowships Nov 2019 — May 2021

Georgetown University

Graduate Research Assistant to Prof. C. Christine Fair · Washington, DC

• Co-authored a peer-reviewed paper in Small Wars & Insurgencies (2021): Did India's demonetization policy curb stone-pelting in Indian-administered Kashmir?

• Companion article at Gateway House (2021): Linking demonetization and stone-pelting.

• Created a stone-pelting time-series dataset for J&K — 608 Harvard Dataverse downloads.

• Tabulated Pakistani household-survey data in STATA (education vs. income).

09 / 10 Education Aug 2019 — May 2021

Georgetown University

MS, Data Science for Public Policy

McCourt School of Public Policy. Coursework spanning statistics, data science I–III, data visualization & GIS, massive data, and computational linguistics — with the merit-based Graduate School Financial Aid Award.

The communication & leadership thread continued here: teaching assistant (above), SAPRI Treasurer, and GradGov Senator.

10 / 10 Education Aug 2015 — May 2019

Mumbai University

BE, Electronics & Telecommunication

Where the communication & leadership roots took hold — ~50 Model UN conferences and two years (2017–2019) as a freelance communication-skills trainer at SkillSphere Education.

Hall of Fame (2018–19); built a Bluetooth-controlled Arduino medication dispenser; Debate Coordinator running 1,000+-participant conferences.

What I work with

Competencies

Languages & Tools

  • Python
  • SQL
  • R
  • PySpark
  • Hive
  • GCP
  • Tableau
  • Power BI
  • STATA

Methods & Techniques

  • AI / LLM evaluation
  • Agent benchmarking
  • Eval infrastructure
  • Data pipelines / ETL
  • Machine learning
  • NLP (BERT/RoBERTa)
  • OCR pipelines
  • Statistical modeling
  • Hypothesis testing
  • A/B testing
  • Data visualization
Spoken Languages English · Marathi · Hindi
Working Style Leadership · Collaboration · Memos & documentation
Get in Touch with me

Let's Work Together

Digvijay Ghotane

Digvijay Ghotane

I take on select consulting engagements in data and AI evaluation. Email me directly or use the form.

Email: digvijay.ghotane@gmail.com
Thank you for your message!
I will get back to you as soon as possible.