Awesome
File System Crawler for Elasticsearch
Welcome to the FS Crawler for Elasticsearch
This crawler helps to index binary documents such as PDF, Open Office, MS Office.
Main features:
- Local file system (or a mounted drive) crawling and index new files, update existing ones and removes old ones.
- Remote file system over SSH/FTP crawling.
- REST interface to let you "upload" your binary documents to elasticsearch.
Latest versions
Current "most stable" versions are:
Elasticsearch | FS Crawler | Released | Docs |
---|---|---|---|
6.x, 7.x, 8.x | 2.10-SNAPSHOT | 2.10-SNAPSHOT |
Build and Quality Status
GitHub stats
Documentation
The guide has been moved to ReadTheDocs.
Contribute
Works on my machine - and yours ! Spin up pre-configured, standardized dev environments of this repository, by clicking on the button below.
License
Read more about the Apache2 License.
Thanks
Thanks to JetBrains for the IntelliJ IDEA License!
Thanks to SonarCloud for the free analysis!