Perhaps I'm in the minority, but seeing open-source used in the description made me think you were using or providing an openly available LLM in addition to the chat/search features. Instead it seems this is "merely" (I don't mean to undermine the level of effort involved) using OpenAI's GPT-4 API for its LLM.
This sort of reek of a growth mindset where you are using "open-source" for the purposes of looking cool and gaining users, but you are in fact trying to grow as quickly as possible to prove to investors that they should fund you for your next round.
I have no reason to believe that's the case for you in particular; just letting you know that some people may perceive things that way. Maybe you could make it clearer that it is a GPT-4 frontend of sorts?
Hey activatedgeek! Thanks for sharing Khoj. @110 and I are the developers.
Lots of great discussion going on in this thread. Two things we want to clarify:
1. Search works offline. Chat uses OpenAI.
2. We're working on adding open source LLM support for chat. We're evaluating quality and ease of setup for this.
If you find the project interesting, hop on our Discord and share your thoughts: https://discord.gg/BDgyabRM6e.
We very much want to hear about your experiences and how we can make something more useful for the community.
At this point I'm surprised nobody connects these tools to Gmail, Gsuite, and/or a posix structure. If it has to be my self hosted AI assistant I should be able to provide my documents to it, right?
The example shown doesn't really fit what I associate with "personal assistant". Assistants do tasks, not answer questions like "where do good ideas come from?". I can ask that ChatGPT without any third-party middlemen.
Just had a look at the code. It’s a cool project that’s clearly had a lot of thought put into it.
If the devs are still around, I’d love to hear about your experiences with embeddings.
Here is a test to assess quality of these assistants.
(1) upload the bitcoin white-paper. (2) ask question “What is the contribution of R.C. Merkle to this reasearch?”
The proper answer should mention “Merkle Trees”.
Khoj means 'search' in some Indian languages
Nice work! I think one way I would definitely use it is if I can just ask questions about my downloads folder :) on my mac. If you are like me, you probably have papers, invoices, proof of addresses, passports and stuff like that inside. And would I be able to ask what's the passport number of ... so I can enter it into the web check in for a plane. Or if I need to know what my last electricity bill was ?
This uses ChatGPT, and the article makes no promise that our personal data will not be sent to ChatGPT.
No, thanks.
What is the difference to e.g. KnowledgeGPT?
https://news.ycombinator.com/item?id=34652921
I think i will have to test both solutions myself...
Hi there! To the developers:
Is there a way to use a personally owned and hosted LLM? If not, is there an interest in developing such a feature?
It is impossible for me To read this site on my iphone because the header size keeps changing with the typing animation so the text is moving up and down every second.
Would like to see this support Word documents also. Does not sound like those are as yet.
relies on ClosedAI, what's the point of being the 47373th app that does so?
Notion plug-in would be fantastic
- Seen a few of these. Are you all working on providing an easy way to maybe use LLMs for chatting/search without sending my data to OpenAI? If yes, how will you verify the quality is "reasonable"?
- How is this better than Rewind, Needl, Mem, etc all the personal search engine that have been doing the rounds lately from various knowledge bases? Is the selling point that it's Open-source? Also if Apple improves spotlight, I wonder how useful this will be.