This looks solid, being able to search across diagrams and videos is a big win. Curious how well it performs with noisy scanned PDFs or annotated images.
Thank you!! It works particularly well with those. We use ColPali-style embeddings for our visual doc search. As a result, we're not limited by parsing quality the same way typical RAG systems are. Here's a link to a blog I wrote on ColPali, for reference: https://docs.morphik.ai/concepts/colpali
This looks solid, being able to search across diagrams and videos is a big win. Curious how well it performs with noisy scanned PDFs or annotated images.
Thank you!! It works particularly well with those. We use ColPali-style embeddings for our visual doc search. As a result, we're not limited by parsing quality the same way typical RAG systems are. Here's a link to a blog I wrote on ColPali, for reference: https://docs.morphik.ai/concepts/colpali
Just adding some of our roadmap here as well:
- AST parsing and conversion to visual graphs for easier understanding of large codebases
- Integrate custom knowledge graph editing, parsing, and retrieval with Morphik MCP
- Slack, Jira, and Confluence integrations
Apologies for the poor formatting!
Features:
- Multimodal search across text, diagrams, and videos
- Natural language knowledge base management
- Fully open-source with responsive support
Great job with the core product. The MCP update isn't as interesting honestly.
Thanks for the feedback :) We're using the MCP daily as we develop Morphik further. Thought it would be a nice thing to share.
"I used the stones to [make] the stones" haha