Skip to content

feat: add search_indexes parameter to ProcessChunks and ProcessFileEmbedding (#856)#1171

Open
pyramation wants to merge 1 commit into
mainfrom
feat/process-chunks-search-indexes
Open

feat: add search_indexes parameter to ProcessChunks and ProcessFileEmbedding (#856)#1171
pyramation wants to merge 1 commit into
mainfrom
feat/process-chunks-search-indexes

Conversation

@pyramation
Copy link
Copy Markdown
Contributor

Summary

Adds search_indexes parameter to the ProcessChunks node type definition and the ProcessFileEmbedding chunks sub-config in the node type registry.

search_indexes: {
  type: 'array',
  items: { type: 'string', enum: ['fulltext', 'bm25', 'trigram'] },
  default: ['fulltext']
}

This enables hybrid RAG workflows — keyword search (tsvector/BM25/trigram) alongside vector similarity search on the chunks table's content column.

The default [\"fulltext\"] is centralized here in the parameter_schema and applied by table_module normalization before reaching the SQL generator.

Review & Testing Checklist for Human

  • Verify search_indexes property matches the SQL column type (jsonb array of strings)
  • Confirm default ['fulltext'] aligns with the SQL column default '[\"fulltext\"]'::jsonb

Notes

  • Companion to constructive-io/constructive-db#1164 (SQL implementation)
  • Closes constructive-io/constructive-planning#856 (together with the DB PR)

Link to Devin session: https://app.devin.ai/sessions/2b5a29d83d3f478e8d3d972653b4879c
Requested by: @pyramation

…bedding (#856)

Adds search_indexes to ProcessChunks parameter_schema and ProcessFileEmbedding's
chunks sub-config with default ['fulltext']. Enables hybrid RAG by opting into
fulltext (tsvector), bm25, or trigram search on the chunks content column.

Companion to constructive-io/constructive-db#1164
@devin-ai-integration
Copy link
Copy Markdown
Contributor

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

  • Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
  • Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

  • Disable automatic comment and CI monitoring

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant