Files
nextcloud-ocr/README.md
2021-09-25 12:58:39 -05:00

1.2 KiB

Nextcloud with OCR and Elasticsearch for Full-Text Searching

To get elasticsearch to run, you'll need to run this on the host: sysctl -w vm.max_map_count=262144 (add to /etc/sysctl.conf to make it persistent)

You can build and start all three containers (nextcloud, db and elasticsearch) with:

$docker-compose up

Basic Setup

  1. Go to: http://[host]:9900/
  2. Make up user/pass, I uncheck installing the additional apps at the bottom
  3. Login with user/pass you just created
  4. Disable dashboard under user > settings

OCR

  1. go into settings > basic settings > set cron
  2. install apps (workflow ocr)
  3. create flow:
    1. add new ocr flow
    2. when: file created
    3. file mime type > is > pdf document
    4. (USE IS AND NOT MATCHES!!!)

Full text

  1. Install apps (full text, full text elastic, full text - files)
  2. To to Settings > Full Text Search
  3. Configure settings appropriately. Address is: http://user:pass@elasticsearch:9200/
  4. Now you have to run the fulltextsearch index manually one time
    1. shell into container, enable www-data account (chsh)
    2. go to /var/www/html and run: php occ fulltextsearch:index
    3. disable www-data account (chsh back to nologin)