More

andhuman · 2026-02-21T08:28:34 1771662514

What helped me get a better understanding of oauth and oidc, was this YouTube video [0].

andhuman · 2026-02-16T16:42:34 1771260154

Same setup I’m using! Parakeet and pocket turbo. It’s feels good enough for daily usage.

andhuman · 2026-02-11T11:37:44 1770809864

Have you tired using the run action instead to clean the data? Win+r

andhuman · 2026-02-07T09:00:52 1770454852

I have this nagging feeling I’m more and more skimming text, not just what the LLMs output, but all type of texts. I’m afraid people will get too lazy to read, when the LLM is almost always right. Maybe it’s a silly thought. I hope!

gspr · 2026-02-07T09:48:19 1770457699

This is my fear too.

People will say "oh, it's the same as when the printing press came, people were afraid we'd get lazy from not copying text by hand", or any of a myriad of other innovations that made our lives easier. I think this time it's different though, because we're talking about offloading the very essence of humanity – thinking. Sure, getting too lazy to walk after cars became widespread was detrimental to our health, but if we get too lazy to think, what are we?

agumonkey · 2026-02-07T10:06:17 1770458777

there are some youtube videos about the topic, be it pupil in high school addicted to llms, or adults losing skills, and not dev only, society is starting to see strange effects

atentaten · 2026-02-07T12:40:12 1770468012

Can you provide links to these videos?

agumonkey · 2026-02-07T17:19:53 1770484793

This one is in french (hope you don't mind), https://youtu.be/4xq6bVbS-Pw?t=534 mentions the issues for students and other cognitive issues.

andhuman · 2026-01-30T13:28:37 1769779717

One of the more annoying software that does this is the copilot Office 365 on the web. Every time (!) I open it, it shows a popup on how to add files to the context. That itself would be annoying, but it also steals focus! So you would be typing something and suddenly you’re not typing anymore for M$ decided it’s time for a popup. I finally learned to just wait for the pop up and then dismiss it with esc. Ugh!

noAnswer · 2026-01-30T17:03:28 1769792608

If you login to the exchange online admin center you first have to complete a short "on-rails-shooter" video game. They constantly shuffle shit around and want to give you a tour via popups about it.

I have the admin accounts for multiple companies, so I have to play the game repeatedly.

pixl97 · 2026-01-30T15:36:07 1769787367

MS has never learned to not interrupt the user. This has been a dark pattern for them since their very first window manager.

andhuman · 2026-01-24T07:52:20 1769241140

I built this recently. I used nvidia parakeet as STT, open wake word as the wake word detection, mistral ministral 14b as LLM and pocket tts for tts. Fits snugly in my 16 gb VRAM. Pocket is small and fast and has good enough voice cloning. I first used the chatterbox turbo model, which perform better and even supported some simple paralinguistic word like (chuckle) that made it more fun, but it was just a bit too big for my rig.

PhilippGille · 2026-01-24T08:12:20 1769242340

OP asked:

> Is anyone doing true end-to-end speech models locally (streaming audio out), or is the SOTA still “streaming ASR + LLM + streaming TTS” glued together?

Your setup is the latter, not the former.

andhuman · 2026-01-22T19:19:49 1769109589

It’s uncanny good. I prefer it to pocket, but then again pocket is much smaller and for realtime streaming.

indigodaddy · 2026-01-23T03:47:47 1769140067

Ah right I guess I meant for instant which I assume qwen can't do

andhuman · 2026-01-19T20:21:12 1768854072

Gave it four of my vibe questions around general knowledge and it didn’t do great. Maybe expected with a model as small as this one. Once support in llama.cpp is out I will take it for a spin.

andhuman · 2026-01-16T07:24:43 1768548283

I’ve tried the voice clinking and it works great. I added a 9s clip and it captured the speaker pretty well.

But don’t do the fake mistake I did and use a hf token that doesn’t have access to read from repos! The error message said that I had to request access to the repo, but I’ve had already done that, so I couldn’t figure out what was wrong. Turns out my HF token only had access to inference.

andhuman · 2026-01-10T08:08:23 1768032503

I’ve recently bought the LG with 4th generation OLED, and for me that works for long coding sessions (I use it for work). They shifted or did something with the pixel arrangement for this generation just for text legibility.