Home

Awesome

Apache Tika for AWS Lambda as a layer

AWS Lambda layer containing the latest Apache Tika Server

Getting Started

Click on Layers and choose "Add a layer", and "Provide a layer version ARN" and enter the following ARN.

arn:aws:lambda:us-east-1:764866452798:layer:apache-tika:7

Current version: 2.3.0

How can I use it with the Lambda runtime which doesn't have Java installed?

The latest Apache Tika version requires Java 8 to be able to work.

If your runtime doesn't have Java installed, you can use Java 8 Lambda layer, which actually was designed to make Apache Tika work on Node.js 12.x runtime.

Where can I find Apache Tika .jar file inside of Lambda when I attached the layer?

You can find it at /opt/tika-server.jar (/opt is where Lambda unpacks layers).

Available regions

Update

  1. Go to https://tika.apache.org/download.html, download Apache Tika server runnable jar
  2. Rename the jar file from tika-server-x.xx.jar to tika-server.jar
  3. Create zip archive of Apache Tika server with the filename tika-server.zip
  4. Put zip archive into this repo
  5. Put proper version inside of publish.sh & README.md
  6. Increment layer versions in README.md
  7. Commit & Create Pull Request

License

MIT © Shelf