Replies: 1 comment
-
The error comes from a hardcoded 5-second timeout on the batch embedding step in RAGFlow. This is common when processing large txt or PDF files, or when the embedding model (like BAAI/bge-large-zh-v1.5) is slow or resource-constrained. You can address this in a few ways:
If your input text is very long, make sure it doesn't exceed the model's token limit (e.g., 512 tokens for bge-large-zh-v1). Truncate if needed [source]. If embedding is still too slow, consider using an external embedding service (like vLLM, Ollama, or Xinference) [source]. Finally, check that the task executor service is healthy—restart it if tasks are getting stuck [source]. To reply, just mention @dosu. How did I do? Good | Irrelevant | Incorrect | Verbose | Hallucination | Report 🐛 | Other |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
The txt document parsing error is as follows:[ERROR]Generate embedding error:Function 'batch_encode' timed out after 5 seconds and 2 attempts,see log screenshot for details

Beta Was this translation helpful? Give feedback.
All reactions