More

mika6996 · 2026-02-22T14:13:36 1771769616

Then recommend a better explanation?

mika6996 · 2026-01-22T17:35:22 1769103322

But you can't just switch between installed models like in ollama, can you?

BoredomIsFun · 2026-01-22T17:43:23 1769103803

llama-swap? https://www.nijho.lt/post/llama-nixos/

mika6996 · 2026-01-22T17:31:47 1769103107

You sure this works? inline_completion and chat_panel give me "Property inline_completion is not allowed." - not sure if this works regardless?

Imustaskforhelp · 2026-01-23T15:36:45 1769182605

I really don't know, I had asked chatgpt to create it and earlier it did give me a wrong one & I had to try out a lot of things and how it worked on my mac

I then pasted that whole convo into aistudio gemini flash to then summarize & give you the correct settings as my settings included some servers and their ip's by the zed remote feature too

Sorry that it didn't work. I um again asked from my working configuration to chatgpt and here's what I get (this may also not work or something so ymmv)

{ "agent": { "default_model": { "provider": "ollama", "model": "hf.co/sweepai/sweep-next-edit-1.5B:latest" }, "model_parameters": [] },

  "ui_font_size": 16,
  "buffer_font_size": 15,

  "theme": {
    "mode": "system",
    "light": "One Light",
    "dark": "One Dark"
  },

  // --- OLLAMA / SWEEP CONFIG ---
  "openai": {
    "api_url": "http://localhost:11434/v1",
    "low_latency_mode": true
  },

  //  TAB AUTOCOMPLETE (THIS IS THE IMPORTANT PART)
  "inline_completion": {
    "default_provider": {
      "name": "openai",
      "model": "hf.co/sweepai/sweep-next-edit-1.5B"
    }
  },

  //  CHAT SIDEBAR
  "chat_panel": {
    "default_provider": {
      "name": "openai",
      "model": "hf.co/sweepai/sweep-next-edit-1.5B"
    }
  }
}

mika6996 · 2025-12-29T18:06:02 1767031562

What would tinygrad replace if they continue to proceed like this?

spiderfarmer · 2025-12-29T18:50:20 1767034220

Potentially PyTorch and Tensorflow.

cyberax · 2025-12-29T20:53:00 1767041580

I think it has great potential for deployments on edge systems.

piskov · 2025-12-29T23:21:12 1767050472

It is already used in comma.ai’s openpilot hardware

vessenes · 2025-12-30T18:19:40 1767118780

But that is an inside deal - same founder, I believe

0xpgm · 2025-12-31T08:04:46 1767168286

Eating your own dogfood is good validation.

mika6996 · 2025-12-13T21:05:34 1765659934

Who likes this bullshit of ads anyway?

mika6996 · 2025-12-10T08:01:23 1765353683

Did you try this method on any model? What do benchmarks say?

ZaneHam · 2025-12-10T08:14:25 1765354465

Honest answer: I tested it on GPT-2 (124M) and the results are mixed. The mathematical claims hold up. I ran 58 tests covering ternary matmul correctness, memory compression, and numerical stability. The 16x compression works, the zero-multiplication property is verified, and the epistemic layer correctly abstains on high-entropy distributions. What does not work is post-training quantization. When I quantized GPT-2's weights to ternary and ran generation, the output was garbage. This is expected because the model was never trained with ternary constraints. BitNet gets coherent output because they train from scratch with ternary baked in. I did not do that. The actual novelty here is not the quantization itself but the epistemic output layer that treats the ternary zero as "I do not know" rather than just sparsity. My tests show it correctly abstains on future predictions and impossible knowledge while answering factual queries confidently. But I should be clear that these tests use designed distributions, not outputs from a trained model. I do not have the compute to train a ternary model from scratch, so coherent generation remains theoretical. The code is at github.com/Zaneham/Ternary_inference if you want to poke at it. Happy to be proven wrong on any of this. tl:dr yes it works but current models aren't made for it. The most interesting thing is the llm can say when it doesn't know.

mika6996 · 2025-08-28T09:29:12 1756373352

Somebody with an archive of this article?

dotcoma · 2025-08-28T09:38:31 1756373911

This seems to be working

https://uk.news.yahoo.com/advice-sam-altman-read-jacques-040...

mika6996 · 2025-08-23T21:23:25 1755984205

I really love the recommended background music while reading this article. More blogs should adopt that - really quirky.

mika6996 · 2025-07-02T21:03:52 1751490232

Does anybody really think this is a plausible theory?

verdverm · 2025-07-02T22:17:28 1751494648

as much as I do when some single author posts they have proved P=NP

gus_massa · 2025-07-03T02:16:40 1751509000

Also, this has not been published in a peer review journal. Not everything that is published in a peer review journal is true, but it's a minimal filter.

verdverm · 2025-07-03T02:29:28 1751509768

One might say the peer review is a trust signal, and is one of the many signals used to evaluate scientific reaults

mika6996 · 2025-06-20T19:57:13 1750449433

Which LLM is running with Harper?

ognarb · 2025-06-20T22:56:39 1750460199