Awesome
JWAT-Tools
JWAT-Tools is an extension to the JWAT utility libraries.
This project currently includes a commandline tool with various gzip/arc/warc/xml tasks.
Tasks
- Arc2Warc
- CDX
- Changed
- Compress
- ContainerMD
- Decompress
- Delete
- Extract
- Interval
- PathIndex
- Test
- Unpack
Downloads
Releases and snapshot can also be download by following these links.
History
V0.6.3-SNAPSHOT
- Added verify code to ARC/WARC compress task.
- Compress file writes old/new filename/length/md5 to system.out for now.
V0.6.2
- Updated dependency to JWAT-1.0.4.
V0.6.1
- Thomas LEDOUX: ContainerMDUtils class - function encodeContent() replaced by new function which is faster (about 10 times) and takes care of all the protected characters (even the control ones).
- Added extract code for WARC and GZip. Unit tests to follow soon.
- Added version to manifest. Show version in manifest instead of a constant string.
- Removed some JVM options in the start scripts.
- Moved common classes from JWAT-Tools to JWAT.
V0.6.0
- It is tedious not having deployed this artifact to maven central. So this and future versions will be deployed.
- Fixed serious NPE in PayloadManager for large files.
- Moved some common classes from JWAT-Tools to JWAT. (They will disappear in the new snapshot version)
- Changed to used JWAT-1.0.3
Maven
jar dependency.
<dependency>
<groupId>org.jwat</groupId>
<artifactId>jwat-tools</artifactId>
<version>0.6.0</version>
</dependency>
The following 2 artifacts should be executable and with all required dependencies included, when unpacked.
<dependency>
<groupId>org.jwat</groupId>
<artifactId>jwat-tools</artifactId>
<version>0.6.0</version>
<type>tar.gz</type>
</dependency>
or
<dependency>
<groupId>org.jwat</groupId>
<artifactId>jwat-tools</artifactId>
<version>0.6.0</version>
<type>zip</type>
</dependency>