2021-09-25 12:57:54 -05:00
2021-09-25 12:52:05 -05:00
2021-09-25 12:52:05 -05:00
2021-09-25 12:52:05 -05:00
2021-09-25 12:57:54 -05:00

= Nextcloud with OCR and Elasticsearch for Full-Text Searching

To get elasticsearch to run, you'll need to run this on the host: sysctl -w vm.max_map_count=262144 (add to /etc/sysctl.conf to make it persistent)

You can build and start all three containers (nextcloud, db and elasticsearch) with:

$docker-compose up

== Basic Setup

  1. Go to: http://[host]:9900/
  2. Make up user/pass, I uncheck installing the additional apps at the bottom
  3. Login with user/pass you just created
  4. Disable dashboard under user > settings

== OCR

  1. go into settings > basic settings > set cron
  2. install apps (workflow ocr)
  3. create flow:
    1. add new ocr flow
    2. when: file created
    3. file mime type > is > pdf document
    4. (USE IS AND NOT MATCHES!!!)

== Full text

  1. Install apps (full text, full text elastic, full text - files)
  2. To to Settings > Full Text Search
  3. Configure settings appropriately. Address is: http://user:pass@elasticsearch:9200/
  4. Now you have to run the fulltextsearch index manually one time
    1. shell into container, enable www-data account (chsh)
    2. go to /var/www/html and run: php occ fulltextsearch:index
    3. disable www-data account (chsh back to nologin)
Description
Nextcloud docker with elasticsearch and functionality for OCR workflow and fulltext search.
Readme 27 KiB
Languages
Dockerfile 100%