A recipe manager for our family that strips all the SEO text out using the OpenAI API. I built this after someone in our family got diagnosed gluten intolerant and we had to make changes to our usual recipes.
Normal recipe sites tend to be full of irrelevant (SEO optimized) text, ads and tracking, and I wanted something to just get the recipe in a clean form.
It’s a basic web application (mostly in Go) to manage recipes. New recipes are imported from an URL, after which it extracts the plain text from the site and uses GPT to get a markdown formatted recipe and list of ingredients.
This would’ve been much harder pre-GPT, but now was trivial to implement.
> This would’ve been much harder pre-GPT, but now was trivial to implement.
I wonder how close looking for a group of lines that start with a number, then taking all the text following that group of lines would get you. I bet that would get you pretty close to the desired effect most of the time.
I also wonder if more standard NLP methods might work here, rather than using the full power of an LLM. Instructions are grammatically constructed as commands, so if you start with doing what I mentioned in the previous paragraph, then parse each sentence following the group of lines starting with numbers, you should be able to determine which ones are commands. If a paragraph contains no commands, it's obviously not directions.
Just some random thoughts. I do see what you mean, though: definitely not trivial sans assistance from an LLM.
On the downside, you'll need to purchase each version separately. I bought it on Apple and then moved to Windows. I'm considering buying it again but I'm curious how well Obsidian would work for it considering it's free and has the rest of my life.
Normal recipe sites tend to be full of irrelevant (SEO optimized) text, ads and tracking, and I wanted something to just get the recipe in a clean form.
It’s a basic web application (mostly in Go) to manage recipes. New recipes are imported from an URL, after which it extracts the plain text from the site and uses GPT to get a markdown formatted recipe and list of ingredients.
This would’ve been much harder pre-GPT, but now was trivial to implement.