This release for safebooru.org covers IDs from 2.700.000 to 2.900.000 (11.2018 - 08.2019) and made in line with:
https://nyaa.si/view/1181364 (2.000.000-2.700.000 ,11.2016-11.2018) 350 GB also composite
https://nyaa.si/view/891391 (1.500.000-2.000.000 , 05.2015-11.2016) 142 GB safebooru only
https://nyaa.si/view/719463 (<1.500.000 till 05.2015) 395 GB safebooru only
Additional images from other booru's ( e-shuushuu.net gelbooru.com yande.re konachan.com anime-pictures.net )
not found on safebooru covers the same posting period.
**This imageset not intended to be "original and complete" but rather "the best of" and "representative"
to help users not to loose interesting fandom, artist or even single prominent picture**
File names structure : **%website% - %id% - %copyright% ~ %character% (%artist%).%ext%**
Images zipped according source and aspect ratio (dimensions2folders):
- “squares” 1x1 (+/- 20%)
- “pages” 3x4 (+/- 8%)
- "highpages" 2x3 (+/- 40%)
- “screens” 3x2 (+/- 40%)
Transformations and filters:
- Mpixels >= 1.2, width >= 900, height >= 900
- PNG converted to JPG
- comic, 4koma, overtexted, primitives cleared
- sometimes crop and gamma correction
- deduplicatied (with AntiDupl NET)
There is some ecchi (zip/folder name **q=questionable** from yande-re and gelbooru) but only censored one.
NOTE similar releases series for [2020](https://nyaa.si/view/1340980) and [2021](https://nyaa.si/view/1452049).
Also don't forget about other rips to get a full background for [BOORU-CHARS OPEN DATASET](https://nyaa.si/view/1384820)
- chan.sankakucomplex.com : https://nyaa.si/view/750972 (2014, 2015) and https://nyaa.si/view/875411 (2016)
- older e-shuushuu : https://www.acgnx.se/show-cceb3260269b5423cbd7f8d59f2c84531750923b.html (2016-2018), https://nyaa.si/view/771715 (2014-2015) and https://nyaa.si/view/513582 (before 2014)
- zerochan.net : https://rutracker.org/forum/viewtopic.php?t=5478026 (2017, russian tracker)
My rips are some sort of "summary" of all (or most) good quality character-centric art available via booru using reasonable resources.
Of course, "originals" are available for regrab (by ID, artist or any criteria), iqdb may be used for search etc.
I'm using this summary for some processing and analysis ( github.com/aperveyev/booru_processor ) and give everybody a chance to get the same stuff by one click.
Isn't it a point ?
Comments - 3
TGminer
AlexPUA (uploader)
SomaHeir