docs(search): add note about re-indexing when enabling Tika by michaelstingl · Pull Request #2285 · opencloud-eu/opencloud
Description
Add notes to the search service README clarifying that:
- Enabling Tika does not automatically re-extract content from already indexed files
- The
opencloud search index --all-spacescommand skips files with unchanged modification time - Workaround: delete the Bleve search index before re-indexing to force full content extraction
Related Issue
Motivation and Context
When users enable Tika on an existing instance, they expect full-text search to work for all files. However, opencloud search index --all-spaces skips files already in the index (mtime-based check in services/search/pkg/search/service.go), so the Tika extractor is never called for previously indexed files. This is undocumented and confusing.
How Has This Been Tested?
- test environment: Read the source code in
services/search/pkg/search/service.go(IndexSpace method, mtime skip logic at line ~495) - test case 1: Verified the skip behavior by tracing the code path: IndexSpace → Walk → mtime check → skip if already indexed
- test case 2: Confirmed no
--forceflag exists in the CLI or protobuf definition (IndexSpaceRequest)
Types of changes
- Bug fix (non-breaking change which fixes an issue)
- New feature (non-breaking change which adds functionality)
- Breaking change (fix or feature that would cause existing functionality to change)
- Technical debt
- Tests only (no source changes)
Checklist:
- Code changes
- Unit tests added
- Acceptance tests added
- Documentation added