For the longest time, I’ve been trying to figure out a way to “survive” in this new AI age without having to fork over a ton of money just to keep up. I’ve tried using local models via Ollama, and while they definitely work to a degree, they’re (unsurprisingly) not as good as the big model providers.

The local models tend to

  • Forget what they’re doing
  • Struggle to break larger tasks into smaller ones
  • Lose focus easily
  • Have weaker coding performance
  • Drift over longer sessions

So to improve the reliability of fully local, smaller models (and to keep all my data local and in my own network), I created Loki.

It’s a local-first, batteries-included command line tool and runtime for building and running LLM workflows locally. It’s model agnostic and supports things like

  • Agents and agent delegation
  • Roles/personas
  • MCP Servers
  • RAG
  • Custom tools
  • Macros
  • Workflow Scripting

A lot of the features it supports are specifically designed to compensate for weaknesses in smaller local models. For example:

  • Auto continuation to keep pushing models to completion instead of stopping halfway through problems
  • Parallel agent delegation so tasks can be split into smaller, focused scopes
  • Workflow-based execution (“If this, do that”) for building more reliable and repeatable automations

It also supports the major cloud providers if you want them (which definitely helped while testing 😄), but my long-term goal is simple:

Get as close as possible to Claude Code-style reliability using fully local models.

I’m always open to feedback, questions, or ideas.

Repo: https://github.com/Dark-Alex-17/loki

  • naught@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    7
    ·
    21 hours ago

    Opencode isn’t very fun to set up with local LLMs and I’ve had issues with tool calling, but it’s very doable! That said, OpenCode is my go-to, absolutely love it compared to all alternatives I’ve tried

    • alehc@slrpnk.net
      link
      fedilink
      English
      arrow-up
      2
      ·
      12 hours ago

      Haven’t configured much beyond this but what’s wrong with ollama launch opencode <model>? Haven’t had an issue yet.

      • naught@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        1
        ·
        7 hours ago

        yo… what? i was configuring json files n shit with custom sources. granted i used lmstudio as ollama doesn’t support mlx models or something for mac. This is definitely the easy way if you’re using ollama.

      • naught@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        1
        ·
        7 hours ago

        Is that included in the token & cost counter? I haven’t really noticed that yet. It’s just the most reliable and best harness i’ve yet used. For context i’ve only otherwise tried claude, gemini, and aider. More if you count non cli apps

        • Evotech@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          edit-2
          6 hours ago

          Yes of course. It’s all tools and skills and system stuff

          But 10k isn’t much in the grand scope. But it can be a big hurdle if you want to use opencode with local small models

    • Dark-Alex-17@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      7
      ·
      20 hours ago

      When it comes to writing code, OpenCode is my go-to as well. It’s my ultimate benchmark for how well optimized and reliable I can make local models function in Loki.