Rajeev Jain

Rajeev Jain

Rajeev Jain

Research software engineer building high-performance tools for climate science, cancer research, and exascale computing. Lead developer of UXarray (205+ GitHub stars), 2× R&D 100 Award winner, with 16+ years optimizing scientific workflows at Argonne National Laboratory.

Principal Specialist · Argonne MCS Division · Joint appointment at University of Chicago

Featured Work

UXarray: Python library for unstructured climate grid analysis

Challenge: Climate scientists working with unstructured grids (MPAS, ICON, SAM) lacked Python tools for conservative analysis that preserve integral quantities across non-uniform meshes.

Contribution: Lead developer since project inception, implementing core mathematical operators including conservative zonal averaging using Gauss-Legendre quadrature, Grid I/O readers for multiple formats (ESMF, MPAS, SCRIP, HEALPix), and testing infrastructure. Established continuous integration and regular PyPI releases.

Impact: UXarray is now used by researchers at NCAR, DOE labs, and universities worldwide (205+ GitHub stars). Enables analysis of multi-petabyte climate datasets with validated accuracy. Presented tutorials at SC24, AMS 2024, and ESDS Annual Event.

FLASH-X: I/O optimization for exascale multiphysics simulations

Challenge: Checkpoint and restart operations were taking 30-50% of total runtime in billion-element FLASH-X simulations on leadership-class supercomputers.

Contribution: Implemented asynchronous HDF5 I/O with Argobots for non-blocking checkpoint operations and integrated SZ3/ZFP compression. Built verification workflow with nightly baseline testing to ensure reproducibility. Enabled cross-checkpoint restart between AMReX and Paramesh solvers.

Impact: Achieved 40-70% reduction in checkpoint write times on Summit supercomputer. Compression reduced storage requirements by 50%+ with minimal accuracy loss. Published at SC24 workshop, contributed to R&D 100 Award (2022).

CANDLE/IMPROVE: hyperparameter optimization for cancer drug response models

Challenge: Cancer drug response prediction models showed poor generalization across different pharmacogenomic datasets, requiring systematic benchmarking and optimization.

Contribution: Built hyperparameter optimization (HPO) infrastructure and ran 10,000+ training experiments across Summit, Theta, and Cori supercomputers. Developed GitHub Actions workflows for cross-study validation. Maintained benchmarking framework and co-authored standardization guidelines.

Impact: Benchmarking framework used by 15+ researchers across the project. Results published in Briefings in Bioinformatics (2025) and presented at 20th Workflows Workshop (2025). Contributed to R&D 100 Award (2023).

Recognition

  • R&D 100 Award 2023: CANDLE — Cancer Distributed Learning Environment for drug response prediction
  • R&D 100 Award 2022: FLASH-X — Multiphysics simulation software for exascale computing
  • Best Paper Award: International Meshing Roundtable (2010) — Reactor core mesh generation

Background

Argonne National Laboratory (2009–present) — Principal Specialist, Research Software Engineering. Lead developer for UXarray, FLASH-X, CANDLE/IMPROVE, MeshKit, and urban simulation projects.

University of Chicago (2023–present) — Staff At-Large. Joint appointment supporting cancer and earth science research.

Education: M.S. Computer Science (University of Chicago, 2020) · M.S. Structural Engineering (Arizona State University, 2009) · Arizona State University Graduate Fellowship (2007-2009)

Contact

rajeeja@gmail.com