Setting up a LLM using OpenAI or similar on my own datasets

James_Mullins · August 3, 2025, 10:00am

Hi team

I’ve put this question in general as I’m not sure it fits the profile of the AI topic.

I guess wondering if anyone has setup and end to end solution for a user based LLM.

I’d love to setup a web app that gives my users responses to questions based off a dataset that I’ve created. Ie. A whole heap of pdf files will be searched and a response will be given to the user based on knowledge available and the users question.

OpenAI suggests that I need to setup a vector database or similar to hold my pdfs etc.

Has anyone done something similar?

Ryan_Hartz · August 4, 2025, 1:07am

I just completed such a project. I’m traveling this week but will try to make up a mock example project for you later in the week. Ping me Wednesday/Thursday if I don’t respond by then

Instead of pdf files, I had all my content in a database table. The first part breaks each topic into smaller chunks. The second part embeds these chunks into a vector format. The third part is sending the query to OpenAI for the response using only my content

Also, my project is a desktop project, but I’m sure it could work for web with minimal changes