Newsy.co

www.transcriptionists.com

Show HN: Desktop conversation practice tool for serious language learners

I’ve been studying Japanese for a few years pretty seriously (anki decks, textbooks, online tutor) but I always felt that the speaking practice pipeline could be optimized. With a real tutor, my mistakes didn’t get enough specific reps, they oftentimes weren’t pedantic enough, and practice in multiple contexts was a problem. (I also subconsciously avoid saying things I know I might mess up on to not sound dumb to a real human)I’m pretty bearish on most other apps, they seem mostly for casual lea

Show HN: Push-to-talk dictation for Android apps and terminal workflows

I built this because MacWhisper is not available on Android and voice typing on Android is pretty bad. Moreover Gemini does not allow you to edit transcripts before they are auto-sent.I like my SwiftKey keyboard though, so I did not want to replace that. So the only way was to make a floating push-to-talk button on top of any app.You tap the overlay, speak, tap again, transcribe, and insert text into the currently focused field.It supports local on-device transcription, cloud transcription with

Show HN: EdgeWhisper – On-device voice-to-text for macOS (Voxtral 4B via MLX)

I built a macOS voice dictation app where zero bytes of audio ever leave your machine.EdgeWhisper runs Voxtral Mini 4B Realtime (Mistral AI, Apache 2.0) locally on Apple Silicon via the MLX framework. Hold a key, speak, release — text appears at your cursor in whatever app has focus.Architecture: - Native Swift (SwiftUI + AppKit). No Electron. - Voxtral 4B inference via MLX on the Neural Engine. ~3GB model, runs in ~2GB RAM on M1+. - Dual text injection: AXUIElement (preserves undo stack) with N

Show HN: TypeWhisper – speech-to-text with multiple engines, profiles

Hey HN, I'm Marco, the creator of TypeWhisper.TypeWhisper is a free, open-source speech-to-text app for macOS and Windows. Everything runs locally on your machine - no cloud, no telemetry, no data collection. Your voice never leaves your device.What makes it different:- Multiple engines, your choice: On macOS: WhisperKit, Parakeet TDT, Apple SpeechAnalyzer. On Windows: Parakeet TDT, Canary 180M Flash. All run on CPU, no GPU needed. You can also plug in cloud APIs (OpenAI, Groq, Deepgram) if

Show HN: Yak – Voice typing tool in Tauri/Rust that auto-presses Enter for you

Hi HN,I built Yak (https://getyak.app), a voice typing tool that converts speech into ready-to-use text. I didn't choose a traditional STT-then-edit pipeline, but a multimodal model that transcribes, polishes, and formats simultaneously, which gives it many interesting features:AI Command:Select text in any app, press the hotkey(default to fn), speak an instruction — "translate to Japanese", "make it shorter". Yak replaces the selection in place.Auto-send:Press

Show HN: TalkBlog – Speak Your Mind. Publish Your Words

Hi everyone.TalkBlog is an app that lets you record audio snippets, edit them in an interactive workspace, render them to HTML with AI transcription, and instantly publish your new blog post to the internet. Or download the HTML and use it how you'd like.The goal is to make it effortless to share the ideas in your head as a blog post.You may have an idea worth sharing, but no time (or motivation) to sit down and type it out.TalkBlog makes it easier to share your authentic ideas with the wor

Show HN: Dumped Wix, my AEC consultancy's storefront is now an AI Edge

I run a building design consultancy for homeowners and architects, not a SaaS firm. Honestly, I'm not going to claim we were trying to build some fantastic ‘anti-fragile alternative’ for the future… I just got tired of paying Wix $40 a month for a brochure no one read. The portfolio was static, the inquiries were generic, and every time a potential client asked about setback variances, I'd lose hours explaining the same thing. So last December, I told my wife I was killing the website.

Show HN: Snippets – AI-first legacy app: Record messages, deliver years later

I grew up in Singapore, spent a decade in the US, now live in Canada. Family spread across multiple countries - I'm sure it's a typical story for several (most?) of the folks here.My dad passed suddenly in 2024, when he was traveling with my mom. This was my biggest nightmare growing up, when I first moved half the world away from my family in my teens...the one thing that jolts you awake at night ("...what if something happens to my parents and I'm not there?"), and it

Show HN: CastLoom Pro – Turn podcasts into a personal knowledge base

Hi HN,I’m Ethan, an indie developer.I listen to a lot of podcasts while coding or commuting, and I often want to save interesting insights from episodes. I tried tools like MacWhisper for transcription, but it only works on macOS and the workflow didn’t quite fit what I wanted.So I built CastLoom Pro.It’s a desktop app that lets you search, play, download, transcribe, translate, and archive podcasts in one place. The idea is to turn podcasts into something searchable and reusable instead of just

Show HN: Rhesis AI - Multimodal test cases for agentic evals

Hey HN, Nicolai here, co-founder of Rhesis AI.Most eval frameworks were designed when LLM inputs were text strings. That assumption breaks fast once your AI agent handles boarding passes, invoices, audio recordings or support screenshots. Text-only test cases become workarounds. So we added multimodal support to Rhesis: attach a file to a test case, run it, evaluate the response. Simple on the surface. Two non-obvious problems underneath.Normalizing file delivery across endpoints: Rhesis sends t

Show HN: Earleaf – An audiobook player that syncs with your physical book

Hi HN! After 15 years as an iPhone user, I recently switched to Android. What I missed the most after switching was my old audiobook player, and I couldn't find one I liked, so I decided to build one.It's called Earleaf, and lets you play your local audibook files.The feature I'm most excited about is Page Sync. You take a photo of a page in your physical book (or e-book) and the app finds that position in the audiobook and jumps to it. It works by transcribing the book on-device,

Nvidia CEO heralds ‘inference inflection’ as next phase of AI boom, backed by $1 trillion in orders

Huang predicted that Nvidia will be grappling with a $1 trillion backlog in orders for its chips by the end of the year, doubling his estimate from a year ago.

BCG consultant behind 'AI brain fry' study says she's 'pessimistic' humans can overcome it anytime soon

While AI tools increase productivity, they may also cause 'brain fry,' a mental fatigue affecting workers who use the tech a lot.

NVIDIA claims DLSS 5 will deliver 'photoreal' image quality with AI this fall

Just months after announcing DLSS 4.5 at CES, NVIDIA has unveiled its next major upscaling technology, DLSS 5. The company is ...

Memories.ai is building the visual memory layer for wearables and robotics

Memories.ai is building a large visual memory model that can index and retrieve video-recorded memories for physical AI.

Prediction: This Artificial Intelligence (AI) Stock Will Skyrocket After March 18 (Hint: It's Not Micron)

This contract electronics manufacturer has been a solid investment in the past year, and its upcoming quarterly report could ...

Is this product 'human-made'? The race to establish an AI-free logo

The backlash to the growing use of the tech has led to an explosion in attempts to come up with 'AI-Free' logo that could be used globally.

The world’s most valuable company just sent another signal that AI agents are going to be everywhere

Tech giant Nvidia, the world’s most valuable company and the poster child of the AI boom, is banking its future on the rise of AI agents.

Skild AI, Nvidia deploy robot brain on Blackwell assembly lines

March 16 (Reuters) - Skild AI's artificial intelligence model will power robots manning Foxconn's assembly lines in Houston, ...

Deepak Jain to Host Two Sessions at Nvidia GTC 2026

Deepak Jain, Founder of AiNET Factory and a developer of large-scale AI infrastructure platforms, to Host Two Sessions at NVIDIA GTC 2026. The sessions focus on the economics and deployments of large-scale AI Factories. San Jose, CA — March 12, 2026 — Deepak Jain, founder of AiNET Factory, will host two sessions at NVIDIA GTC 2026, the global AI conference organized by NVIDIA. At GTC 2026, Jain will lead conversations focused on one of the most pressing challenges facing artificial intelligence