A lemmy nomad. Wish there was a way to migrate posts and comments from .world to .ml to here… 😪

  • 3 Posts
  • 69 Comments
Joined 2 months ago
cake
Cake day: March 14th, 2025

help-circle

  • will@lemm.eetoLocalLLaMA@sh.itjust.worksSpecialize LLM
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 days ago

    Making your own embeddings is for RAG. Most base model providers have standardized on OpenAIs embeddings scheme, but there are many ways. Typically you embed a few tokens worth of data at a time and store that in your vector database. This lets your AI later do some vector math (usually cosine similarity search) to see how similar (related) the embeddings are to each other and to what you asked about. There are fine tuning schemes where you make embeddings before the tuning as well but most people today use whatever fine tuning services their base model provider offers, which usually has some layers of abstraction.


  • I don’t know about micro but I keep a conventional amount of starter in the fridge and have had it for 5+ years. If I’m out of a bread phase I take it out once every few months, let it come to temperature and feed it. When it gets bubbly and happy again I give it more flour and water till it’s thick and stick it back in the fridge. When my next bread phase kicks in I leave it out of the fridge for a day, feed it again and then use it like normal (once I see it can double in size). Very little waste this way and super-low effort.

    I’ve also dehydrated strips of extra-thick starter and have successfully reanimated them years later (just did it recently with 4+ year dehydrated starter in fact).


  • will@lemm.eetoLocalLLaMA@sh.itjust.worksSpecialize LLM
    link
    fedilink
    English
    arrow-up
    7
    ·
    3 days ago

    The easiest option for a layperson is retrieval augmented generation, or RAG. Basically you encode your books and upload them into a special kind of database and then tell a regular base model LLM to check the data when making an answer. I know ChatGPT has a built in UI for this (and maybe anthropic too) but you can also build something out using Langchain or OpenWebUi and the model of your choice.

    The next step up from there is fine tuning, where you kinda retrain a base model on your books. This is more complex and time consuming but can give more nuanced answers. It’s often done in combination with RAG for particularly large bodies of information.







  • will@lemm.eetoScience Memes@mander.xyz4 fundamental forces
    link
    fedilink
    English
    arrow-up
    127
    arrow-down
    1
    ·
    4 days ago

    Shouldn’t gravity be like a tiny, vaguely dragon-shaped worm off in another field?

    I mean messing with the strong force in a fistful of atoms gets you a nuclear bomb. Meanwhile, my old, achy self can jump up and resist against a whole earth’s worth of gravitational force.


  • Fried chick peas (I use cans since they’re more convenient, but even cheaper dried beans are fine too but you have to soak for 24h and then boil them first). But either way, seriously cheap, loaded with protein and fiber, and delicious:

    Rinse beans and dump into a large dry pan on high heat. Move them around until they have mostly dried up and just barely start sticking to the pan. Then add oil - just once or twice around the pan is plenty - and some salt. Then let them fry in that little bit of oil. Move them with a spoon every so often to keep from sticking too much.

    After about 15 min you have these golden brown crunchy and slightly salty little things. They’re great, and go with everything as a side dish.


  • I don’t think it’s heavily defederated, but I did notice enough “missing” content from my .ml that I decided to move to another instance a few months ago. Which is weird, because I never had an issue with other .ml users (since I’m not into politics) and wouldn’t have guessed what’s going on if it wasn’t for other posts like this.







  • ActivityPub and the fediverse were started specifically to deal with the kind of centralization that has lead to the shit state of the Internet today, so I’d say the fact that you’ve made it here means you’ve kinda found what you’re looking for already. Find and read/write WriteFreely/Ghost/Plume blogs (and shitposts) instead of Substack or Medium, use Lemmy for threaded conversations (and shitposts) instead of Reddit, and Mastodon for microblogs (and shitposts) instead of Twitter. Peertube is not a drop-in replacement for Youtube, but also 90% of the new content on Youtube is garbage today anyway, and there’s nothing stopping you from browsing older videos (with Freetube or similar to block a good portion of Youtube’s enshittified UI).

    Plus if you do stick to these off-the-beaten-path alternatives, it’s still a fun time to be a content creator since you’re not focused on maximizing engagement or monetization – which is the true source of the godawful state of the Internet today.