docs(search): add note about re-indexing when enabling Tika by michaelstingl · Pull Request #2285 · opencloud-eu/opencloud

Description

Add notes to the search service README clarifying that:

  1. Enabling Tika does not automatically re-extract content from already indexed files
  2. The opencloud search index --all-spaces command skips files with unchanged modification time
  3. Workaround: delete the Bleve search index before re-indexing to force full content extraction

Related Issue

Motivation and Context

When users enable Tika on an existing instance, they expect full-text search to work for all files. However, opencloud search index --all-spaces skips files already in the index (mtime-based check in services/search/pkg/search/service.go), so the Tika extractor is never called for previously indexed files. This is undocumented and confusing.

How Has This Been Tested?

  • test environment: Read the source code in services/search/pkg/search/service.go (IndexSpace method, mtime skip logic at line ~495)
  • test case 1: Verified the skip behavior by tracing the code path: IndexSpace → Walk → mtime check → skip if already indexed
  • test case 2: Confirmed no --force flag exists in the CLI or protobuf definition (IndexSpaceRequest)

Types of changes

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to change)
  • Technical debt
  • Tests only (no source changes)

Checklist:

  • Code changes
  • Unit tests added
  • Acceptance tests added
  • Documentation added