40 lines
1.2 KiB
Markdown
40 lines
1.2 KiB
Markdown
# Nextcloud with OCR and Elasticsearch for Full-Text Searching
|
|
|
|
To get elasticsearch to run, you'll need to run this on the host:
|
|
sysctl -w vm.max_map_count=262144
|
|
(add to /etc/sysctl.conf to make it persistent)
|
|
|
|
You can build and start all three containers (nextcloud, db and elasticsearch) with:
|
|
|
|
$docker-compose up
|
|
|
|
## Basic Setup
|
|
|
|
1. Go to: http://[host]:9900/
|
|
2. Make up user/pass, I uncheck installing the additional apps at the bottom
|
|
3. Login with user/pass you just created
|
|
4. Disable dashboard under user > settings
|
|
|
|
## OCR
|
|
1. go into settings > basic settings > set cron
|
|
2. install apps (workflow ocr)
|
|
3. create flow:
|
|
1. add new ocr flow
|
|
2. when: file created
|
|
3. file mime type > is > pdf document
|
|
4. (USE IS AND NOT MATCHES!!!)
|
|
|
|
## Full text
|
|
|
|
1. Install apps (full text, full text elastic, full text - files)
|
|
2. To to Settings > Full Text Search
|
|
3. Configure settings appropriately. Address is: http://user:pass@elasticsearch:9200/
|
|
4. Now you have to run the fulltextsearch index manually one time
|
|
1. shell into container, enable www-data account (chsh)
|
|
2. go to /var/www/html and run: php occ fulltextsearch:index
|
|
3. disable www-data account (chsh back to nologin)
|
|
|
|
|
|
|
|
|