{"id":9410,"date":"2025-12-23T10:04:32","date_gmt":"2025-12-23T10:04:32","guid":{"rendered":"https:\/\/serisec.com\/index.php\/2025\/12\/23\/spotify-music-library-with-86m-music-files-scraped-by-hacktivist-group\/"},"modified":"2025-12-23T10:04:32","modified_gmt":"2025-12-23T10:04:32","slug":"spotify-music-library-with-86m-music-files-scraped-by-hacktivist-group","status":"publish","type":"post","link":"https:\/\/serisec.com\/index.php\/2025\/12\/23\/spotify-music-library-with-86m-music-files-scraped-by-hacktivist-group\/","title":{"rendered":"Spotify Music Library With 86M Music Files Scraped by Hacktivist Group"},"content":{"rendered":"<p>    Spotify Music Library With 86M Music Files Scraped by Hacktivist Group<br \/>\n \t<BR><br \/>\n<BR><\/BR><br \/>\n    <!-- no image --><br \/>\n \t<BR><br \/>\n<BR><\/BR><\/p>\n<div>\n<p>The shadow library known as Anna\u2019s Archive has executed a massive scrape of Spotify, releasing a torrent collection containing approximately 86 million audio tracks and <a href=\"https:\/\/cybersecuritynews.com\/hackers-exploiting-ec2-instance-metadata-vulnerability\/\" target=\"_blank\" rel=\"noreferrer noopener\">metadata<\/a> for 256 million songs.<\/p>\n<p>The group, which typically focuses on archiving academic papers and books, claims this <a href=\"https:\/\/cybersecuritynews.com\/mobilegestalt-exploit-ios-26-0-1\/\" target=\"_blank\" rel=\"noreferrer noopener\">unauthorized<\/a> acquisition is the world\u2019s first open \u201cpreservation archive\u201d for music.<\/p>\n<p>The total collection weighs in at nearly 300 terabytes. According to the group, the dump includes the most extensive public music metadata database.<\/p>\n<p>Covering an estimated 99.9% of Spotify\u2019s catalog and representing 99.6% of all streams on the platform.<\/p>\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/blogger.googleusercontent.com\/img\/b\/R29vZ2xl\/AVvXsEijldZEbf5sWe2TUS-zDUE7vD1g5t4o6ejWpk6fvbuTrS_1ONgp9VoN0KL3zcgRWXJQNTGRvbgh88EKuUUgmjJTit0xQZ6MJuv-8HIO4S-EjJW3pMcj1VPGMm51BAB7VookOPzbHxv8B2SghDCdDcIufgvyB0tiauU76BFL1NceAB_GCOg-nyk8wXWBZN4\/s1600\/Screenshot%25202025-12-23%2520115440%2520%25281%2529.webp?ssl=1\" alt=\"Duplicates track count per ISRC\"><figcaption class=\"wp-element-caption\">Duplicates track count per ISRC<\/figcaption><\/figure>\n<p>In a blog post detailing the release, the group admitted they \u201cdiscovered a way to scrape Spotify at scale.\u201d<\/p>\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/blogger.googleusercontent.com\/img\/b\/R29vZ2xl\/AVvXsEi2gxRe3OmwLPWtqi-cdRxaLl36fcM7CmHOKJcXi-rOdFKNEOVNZlgLJ8GUeOZs71_MCfhjqG7CF5Vwr0GgFsO4F9jECoeWtC_Vd0ntEPTWNatgL8J4zUl5Bod99cLyS9jsYfYaBIHYIcmQvypxznAOYPEgGDBUAeTKUHzy-sHmuJh4JBlGWHgx1nwVfN8\/s1600\/Screenshot%25202025-12-23%2520115211%2520%25281%2529.webp?ssl=1\" alt=\"Duplicate album count per UPC\"><figcaption class=\"wp-element-caption\">Duplicate album count per UPC<\/figcaption><\/figure>\n<p>They argue that current music archiving efforts are insufficient because they focus too heavily on high-quality audiophile formats (such as lossless FLAC) or on only popular artists.<\/p>\n<p>This leaves the \u201clong tail\u201d of obscure music <a href=\"https:\/\/cybersecuritynews.com\/fortiweb-authentication-vulnerability-exploited\/\" target=\"_blank\" rel=\"noreferrer noopener\">vulnerable<\/a> to being lost. \u201cOur mission (preserving humanity\u2019s knowledge and culture) doesn\u2019t distinguish among media types,\u201d the group stated.<\/p>\n<p>\u201cSometimes an opportunity comes along outside of text. This is such a case.\u201d To manage the massive file size, the group prioritized quality based on Spotify\u2019s popularity metric.<\/p>\n<p>The most popular songs were archived in their original OGG Vorbis format at 160kbit\/s. However, tracks with a popularity score of zero were re-encoded to OGG Opus at lower bitrates to save space.<\/p>\n<p>A trade-off the group deemed necessary to achieve \u201call music humanity has ever produced.\u201d<\/p>\n<p>The data is being released in stages via BitTorrent. The metadata was released first, followed by the music files, in order of popularity.<\/p>\n<p>The group is explicitly asking the public to seed these torrents to protect the collection against \u201cnatural disasters, wars, and budget cuts.\u201d<\/p>\n<p><span style=\"box-sizing: border-box; margin: 0px; padding: 0px;\">While<a href=\"https:\/\/annas-archive.li\/blog\/backing-up-spotify.html\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">\u00a0Anna\u2019s Archive<\/a>\u00a0frames this as a cultural preservation project, the scrape represents a significant\u00a0breach\u00a0of Spotify\u2019s terms of service.<\/span> It involves the mass distribution of copyrighted material.<\/p>\n<p class=\"has-text-align-center has-background\" style=\"background:linear-gradient(180deg,rgb(238,238,238) 94%,rgb(169,184,195) 100%)\"><strong>Follow us on <a href=\"https:\/\/news.google.com\/publications\/CAAqMggKIixDQklTR3dnTWFoY0tGV041WW1WeWMyVmpkWEpwZEhsdVpYZHpMbU52YlNnQVAB?hl=en-IN&amp;gl=IN&amp;ceid=IN:en\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Google News<\/a>, <a href=\"https:\/\/www.linkedin.com\/company\/cybersecurity-news\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">LinkedIn<\/a>, and <a href=\"https:\/\/x.com\/cyber_press_org\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">X<\/a> for daily cybersecurity updates. <a href=\"https:\/\/cybersecuritynews.com\/contact-us\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Contact us<\/a> to feature your stories.<\/strong><\/p>\n<p>The post <a href=\"https:\/\/cybersecuritynews.com\/spotify-music-library-scraped\/\">Spotify Music Library With 86M Music Files Scraped by Hacktivist Group<\/a> appeared first on <a href=\"https:\/\/cybersecuritynews.com\/\">Cyber Security News<\/a>.<\/p>\n<\/div>\n<p> \t<BR><br \/>\n <BR><\/BR><br \/>\n    Abinaya<br \/>\n \t<BR><br \/>\n<BR><\/BR><br \/>\n<a href=\"https:\/\/cybersecuritynews.com\/spotify-music-library-scraped\/\">Go to cyber-security-news<\/a><br \/>\n \t<BR><br \/>\n <BR><\/BR><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Spotify Music Library With 86M Music Files Scraped by Hacktivist Group The shadow library known as Anna\u2019s Archive has executed a massive scrape of Spotify, releasing a torrent collection containing approximately 86 million audio tracks and metadata for 256 million songs. The group, which typically focuses on archiving academic papers and books, claims this unauthorized [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[129,63,156],"tags":[130],"class_list":["post-9410","post","type-post","status-publish","format-standard","hentry","category-cyber-security","category-cyber-security-news","category-data-breach","tag-cyber-security-news"],"_links":{"self":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/posts\/9410"}],"collection":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/comments?post=9410"}],"version-history":[{"count":0,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/posts\/9410\/revisions"}],"wp:attachment":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/media?parent=9410"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/categories?post=9410"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/tags?post=9410"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}