3 comments

2

This is awesome! I may just have to put this on an external for use down the line.

2

41GB. That's a lot of data. Is there any way to pick out and download just a subset of this data, say from askubuntu?

0

You could download it all and then delete the bits you don't want. You use the manage.sh script to do that, with a command like

./manage.sh manage_sites -d <SITE_KEY>

But honestly I think you wouldn't save much space. If you were to delete everything but the stackoverflow.com data, I think you'd eliminate only about 10 GB. Stack Overflow is much larger than all the others.

If you don't need Stack Overflow you'd eliminate a ton of data, but then you're better off importing the thing you wanted from scratch -- it takes little time to import the smaller sites. Check out the instructions on the stackdump page on how to do that (or ask me, if you prefer).