Shga Sample 750k.tar.gz Link
The file name itself follows standard Linux archiving conventions:
: Journalists from the New York Times and The Wall Street Journal contacted individuals listed in the sample and confirmed that the details, including names, addresses, and police records, were accurate. shga sample 750k.tar.gz
In late June 2022, "ChinaDan" posted a listing offering the full SHGA database for (roughly $200,000 at the time). To prove the data was legitimate, the hacker provided the shga_sample_750k.tar.gz file, which contained approximately 750,000 records divided into three main indices (250,000 records each). The file name itself follows standard Linux archiving
: Denoting the number of records included in the sample. : Denoting the number of records included in the sample
The file, originally uploaded to the now-defunct "Breach Forums" by a user named served as a proof-of-concept to verify the authenticity of a massive 23-terabyte dataset allegedly containing the personal information of 1 billion Chinese citizens . Origin and Significance of the 750k Sample
: Records included individuals from across China, not just Shanghai, covering roughly 7.4% of China's total population . Technical Specifications of the File