Ross Girshick (rbg)
aspiring climbing bum

email  /  arXiv  /  Google scholar

Research

I'm interested in algorithms for visual perception (object recognition, localization, segmentation, pose estimation, ...), representation learning (pre-training networks using strong supervision, weak supervision, or no supervision at all), and the interaction of vision and language. My work explores topics in computer vision and machine/deep/statistical learning.

About me / bio

Ross Girshick is an influential AI researcher with over 500,000 citations, 100 papers, and 6 patents. He's well-known for inventing the R-CNN computer vision algorithm for object detection, which reshaped the field with deep learning techniques in 2013, and for authoring Detectron, widely-used open source software for object detection. Ross has received numerous awards from top conferences and professional associations, including the PAMI Young Researcher Award (2017), three-time winner of the PAMI Mark Everingham Prize (2017, 2021, and 2023) for his contributions to open source software and datasets, and several 10-year test-of-time awards (2024 Longuet-Higgins Prize for R-CNN, the 2025 Helmholtz Prize for Fast R-CNN, NeurIPS 2025 test-of-time award for Faster R-CNN). Ross has a PhD from the University of Chicago, was a postdoc at UC Berkeley, and spent 10 years as a research scientist in top industry and non-profit labs: Microsoft Research (2014-2015), FAIR (Facebook/Meta 2015-2023), and the Allen Institute for AI (2023-2024). In late 2024 he co-founded Vercept, which was acquired by Anthropic in early 2026. Ross currently works at Anthropic researching various random things.

Publications and tech reports on Google scholar


Journal reviewing note: Please do not invite me to review unless you have asked me via a personal message beforehand (though I will most likely decline). I receive many unsolicited requests per week, which I simply delete without reading due to the volume.


Erdös number = 3 (via two paths)