Awesome
New version: https://github.com/sightnet
This project will not be maintained.
==========
==========
<br> <br>Privacy Web Search Engine
Website
Features
Crawler
- Multithreading
- Cache
- Robots.txt
- Proxy
- Queue (BFS)
- Detect Trackers
- Http -> Https
Website / CLI
- Encryption (rsa)
- API
- Proxy
- Nodes
- Rating
Usage (Docker)
Please run the build every time to change the arguments. <br> The site is launched by default on port 8080 AND with tor proxy (<b>!!!</b>), to edit it you need to change config.json and rebuild website. <br> The api key for the database must be changed in the config and when the database is started(--api-key).
DB - please run before using other
sudo docker pull typesense/typesense:0.24.0.rcn6
mkdir /tmp/typesense-data
sudo docker run -p 8108:8108 -v/tmp/data:/data typesense/typesense:0.24.0.rcn6 --data-dir /data --api-key=xyz
Crawler
sudo docker-compose build crawler --build-arg SITES="$(cat sites.txt)" --build-arg THREADS=1 --build-arg CONFIG="$(cat config.json)"
sudo docker-compose up crawler
Website
sudo docker-compose build website --build-arg CONFIG="$(cat config.json)"
sudo docker-compose up website
Usage (Manual)
Deps
cd scripts && sh install_deps.sh
Build
cd scripts && sh build_all.sh
Run
The site is launched by default on port 8080 AND with tor proxy (<b>!!!</b>), to edit it you need to change config.json. <br> The api key for the database must be changed in the config and when the database is started(--api-key).
DB - please run before using other
mkdir /tmp/typesense-data &&
./typesense-server --data-dir=/tmp/typesense-data --api-key=xyz --enable-cors &&
sh scripts/init_db.sh
Crawler
./crawler ../../sites.txt 5 ../../config.json
#[sites_path] [threads_count] [config path]
Website
./website ../../config.json
#[config path]
Instances
¯\(ツ)/¯
TODO
- Docker
- Encryption (assymetric)
- Multithreading crawler
- Robots Rules (from headers & html) & crawl-delay
- Responsive web design
- Own FTS (...)
- Images Crawler
Dependencies
Config
./config.json
Mirrors
License
GNU Affero General Public License v3.0