The file name itself follows standard Linux archiving conventions:
: Records included individuals from across China, not just Shanghai, covering roughly 7.4% of China's total population . Technical Specifications of the File
: Standing for "Shanghai Gov" or "Shanghai Public Security Bureau" (Gongan Ju). shga sample 750k.tar.gz
: A compressed archive format commonly used for large data transfers. Cybersecurity and Geopolitical Impact
: Denoting the number of records included in the sample. The file name itself follows standard Linux archiving
By February 2025, researchers at SpyCloud reported that re-circulated copies of this dataset were still being traded in the underground, with modern iterations containing nearly 960 million rows of data. AI responses may include mistakes. Learn more 2022 - SHGA Shanghai Gov National Police database
: Full names, national ID numbers (resident identity cards), mobile phone numbers, birthplaces, and birthdates. Cybersecurity and Geopolitical Impact : Denoting the number
In late June 2022, "ChinaDan" posted a listing offering the full SHGA database for (roughly $200,000 at the time). To prove the data was legitimate, the hacker provided the shga_sample_750k.tar.gz file, which contained approximately 750,000 records divided into three main indices (250,000 records each).
: Detailed case reports and criminal records, ranging from minor traffic violations to major criminal investigations.
: Security experts, including Binance CEO Changpeng Zhao, suggested the leak occurred due to a misconfigured ElasticSearch database that was left exposed on the internet without a password. Contents of the Dataset