File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -231,7 +231,7 @@ gzip -d data/testing.warc.gz # windows gunzip no work-a
231231compress it the wrong way
232232gzip data/testing.warc
233233
234- showing the records in the compressed warc - note the offsets of request and response are
234+ showing the records in the compressed warc - note the offsets of request and response are identical
235235java -jar jwarc-0.33.0.jar ls data/testing.warc.gz
236236 0 warcinfo - -
237237 3734 request GET https://an.wikipedia.org/wiki/Escopete
@@ -258,7 +258,7 @@ now let's do it the right way
258258gzip -d data/testing.warc.gz
259259mvn -q exec:java -Dexec.mainClass=org.commoncrawl.whirlwind.RecompressWARC -Dexec.args="data/testing.warc data/testing.warc.gz"
260260
261- showing the records in the compressed warc - note the skewed offsets of request and response
261+ showing the records in the compressed warc
262262java -jar jwarc-0.33.0.jar ls data/testing.warc.gz
263263 0 warcinfo - -
264264 518 request GET https://an.wikipedia.org/wiki/Escopete
You can’t perform that action at this time.
0 commit comments