New project See merge request !1
Writes all urls and ips to csvs on the hdfs, usage: spark-submit parse_warc.py START END 2> /dev/null, where START is the starting index and END is the ending index