Nextcloud with OCR and Elasticsearch for Full-Text Searching
To get elasticsearch to run, you'll need to run this on the host: sysctl -w vm.max_map_count=262144 (add to /etc/sysctl.conf to make it persistent)
You can build and start all three containers (nextcloud, db and elasticsearch) with:
$ docker-compose up
Basic Setup
- Go to: http://[host]:9900/
- Make up user/pass, I uncheck installing the additional apps at the bottom
- Login with user/pass you just created
- Disable dashboard under user > settings
OCR
- go into settings > basic settings > set cron
- install apps (workflow ocr)
- create flow:
- add new ocr flow
- when: file created
- file mime type > is > pdf document
- (USE IS AND NOT MATCHES!!!)
Full text
- Install apps (full text, full text elastic, full text - files)
- To to Settings > Full Text Search
- Configure settings appropriately. Address is: http://user:pass@elasticsearch:9200/
- Now you have to run the fulltextsearch index manually one time
- shell into container, enable www-data account (chsh)
- go to /var/www/html and run: php occ fulltextsearch:index
- disable www-data account (chsh back to nologin)
Description
Nextcloud docker with elasticsearch and functionality for OCR workflow and fulltext search.
Languages
Dockerfile
100%