The Socioeconomic High-resolution Rural-Urban Geographic Platform for India (SHRUG) facilitates data sharing between researchers working on India. It is an open access repository currently comprising dozens of datasets covering India's over 500,000 villages and 8000 towns over a span of 25 years, all linked together with a set of common geographic identifiers.
Our goal is to make as many Indian datasets as possible speak to each other at the highest achievable granularity. Browse our data, use as much as you want, and contribute back to this effort to empower everyone to use data for the greater good.
Prior to the SHRUG, linking different Indian datasets was a hassle. The common geographic frame of SHRUG now makes it easy to share information and link data. Researchers can immediately tap into SHRUG to mine a wealth of previously unavailable socioeconomic data for the geographies that they are working on, and can then publish their own data back to SHRUG, making it available to researchers working on other topics.
The SHRUG is an easily linkable dataset covering a wide range of socioeconomic variables in India. Some highlights:
The essential unit of observation of the shrug is
the shrid
, which is a village or town
unit with consistent boundaries since 1991. We put in
a ton of work to reconcile boundary changes across
censuses. Villages and towns merge and split; shrids
are aggregated units that keep the same boundaries
across all periods. You can read more about how we did
this in the SHRUG paper.
The variables are grouped into different modules
based on the data source and subject matter, and
each module can be independently downloaded from
links provided below. Each download link contains
flat data tables at all available geographic
levels (e.g. village, town, constituency). You can
merge at any geographic level you wish using the
appropriate key in the core keys
module. Many
SHRUG variables can be interactively visualized
using the SHRUG Atlas.
If you are using previous versions, we recommend you download version 2.0 or newer. v.1.5 Samosa shrid
IDs do not match to 2.0 Pakora, and many other improvements have been made.
For more information, please see:
SHRUG data is shared using the Creative Commons BY-NC-SA 4.0 International License for non-commercial use. You may use the data for your private research, on condition that you share your own data with SHRUG identifiers when your research paper is published. This makes it easier for others to find your data and cite your work. For inquiries into using the SHRUG for commercial purposes, please contact our team.
For commercial use of the SHRUG, as well as custom data generation requests, please contact our team for capabilities and pricing.