Rajeev Jain

Rajeev Jain

Rajeev Jain

Research Software Engineer · Argonne National Laboratory · University of Chicago

I build software tools for scientific research — climate data analysis, cancer drug response prediction, and high-performance computing. 16 years at Argonne, two R&D 100 Awards, and a passion for open-source scientific software.

Projects

Open-source tools I lead or contribute to — each section is directly shareable.

Project

UXarray

Python library for unstructured climate grid analysis

Challenge

Climate scientists working with unstructured grids (MPAS, ICON, SAM) lacked Python tools for conservative analysis that preserve integral quantities across non-uniform meshes.

My Role

Lead developer since project inception. Implemented core mathematical operators including conservative zonal averaging using Gauss-Legendre quadrature, Grid I/O readers for multiple formats (ESMF, MPAS, SCRIP, HEALPix), and testing infrastructure. Established continuous integration and regular PyPI releases.

Impact

Used by researchers at NCAR, DOE labs, and universities worldwide (205+ GitHub stars). Enables analysis of multi-petabyte climate datasets. Presented tutorials at SC24, AMS 2024, and ESDS Annual Event.

Project · R&D 100 Award 2023

CANDLE / IMPROVE

Hyperparameter optimization for cancer drug response models

Challenge

Cancer drug response prediction models showed poor generalization across different pharmacogenomic datasets, requiring systematic benchmarking and optimization.

My Role

Built hyperparameter optimization (HPO) infrastructure and ran 10,000+ training experiments across Summit, Theta, and Cori supercomputers. Developed GitHub Actions workflows for cross-study validation. Maintained benchmarking framework and co-authored standardization guidelines.

Impact

Benchmarking framework used by 15+ researchers across the project. Results published in Briefings in Bioinformatics (2025) and presented at 20th Workflows Workshop (2025). Contributed to R&D 100 Award (2023).

Project · R&D 100 Award 2022

FLASH-X

I/O optimization for exascale multiphysics simulations

Challenge

Checkpoint and restart operations were taking 30–50% of total runtime in billion-element FLASH-X simulations on leadership-class supercomputers.

My Role

Implemented asynchronous HDF5 I/O with Argobots for non-blocking checkpoint operations and integrated SZ3/ZFP compression. Built verification workflow with nightly baseline testing to ensure reproducibility. Enabled cross-checkpoint restart between AMReX and Paramesh solvers.

Impact

Achieved 40–70% reduction in checkpoint write times on Summit supercomputer. Compression reduced storage requirements by 50%+ with minimal accuracy loss. Published at SC24 workshop, contributed to R&D 100 Award (2022).

Recognition

2023
R&D 100 Award

CANDLE — Cancer Distributed Learning Environment for drug response prediction

2022
R&D 100 Award

FLASH-X — Multiphysics simulation software for exascale computing

2010
Best Paper Award

International Meshing Roundtable — Reactor core mesh generation

Background

2009 – present
Argonne National Laboratory — Principal Specialist, Research Software Engineering

Lead developer for UXarray, FLASH-X, CANDLE/IMPROVE, MeshKit, and urban simulation projects.

2023 – present
University of Chicago — Staff At-Large

Joint appointment supporting cancer and earth science research.

Education

M.S. Computer Science — University of Chicago (2020)
M.S. Structural Engineering — Arizona State University (2009)
B.Tech Mechanical Engineering — IIT ISM Dhanbad (2006)

Contact

Open to collaborations in scientific computing, AI for health, and open-source software.

rajeeja@gmail.com