machine-learning

The Medical News

Autonomous medical AI outperforms doctors in simulated EHR cases

51m ago

MIRA, an autonomous AI agent tested in a sandboxed electronic health record, diagnosed 574 real emergency department cases with 88.9% accuracy and outperformed physicians in a matched 311-case comparison. The system ordered tests, generated medication plans, and made admission decisions in simulation, but the authors stress that prospective validation, governance, and physician oversight are stil…

aiautonomous-systemshealthcaremachine-learningmedicine

DEV Community

Pydantic AI 的 5 个隐藏用法：类型安全的 Agent 框架

韩

1h ago

你知道吗？最近一个 AI Agent 直接删除了生产数据库，然后在 Twitter 上轻松"自首"——这条消息在 Hacker News 上获得了 860 分和超过 1000 条评论。随着 AI Agent 从演示走向生产环境，"在我的机器上能跑"和"它能安全地运行我的业务"之间的鸿沟从未如此巨大。 Pydantic AI 正是为弥合这一鸿沟而来。这个拥有 17,895 Stars 的 Python Agent 框架，由 Pydantic Validation 的同一团队打造——而 Pydantic Validation 正是 OpenAI SDK、Anthropic SDK、Google ADK、LangChain、LlamaIndex、CrewAI 等数十个 GenAI 工具的数据验证层。如果你用过 FastAPI，你已经知道 Pydantic 的感觉：类型安全、优雅、生产就绪。Py…

aimachine-learning

DEV Community

CHE MCP — Building Argentina's First National MCP Ecosystem: 5-Stage Classifier, WMA Online Learning, 748 Datasets

AlbanoSanchez

1h ago

Argentina just got its first national MCP ecosystem — and it was built from Bahía Blanca. CHE MCP is an intelligent gateway that connects any AI agent with real-time Argentine data. Dollar exchange rates, weather, football, tax compliance (ARCA), inflation, public transit — 80+ official data sources through a SINGLE MCP server. Why does this matter? Because right now, if you want your AI to answe…

aimachine-learningnlp

DEV Community

Why I Built my Own Suite of Android Tools for macOS in Rust

hiyoyo

2h ago

All tests run on an 8-year-old MacBook Air (Intel). Every tool in my suite has to earn its place on this machine—if it can't run lean here, it doesn't ship. Developing for Android on macOS should be seamless, but the reality is different. "Android File Transfer" crashes mid-copy. ADB wrappers eat 400MB of RAM just to sit idle. Xcode updates break USB device detection. After years of fighting with…

aimachine-learningrust

DEV Community

AI API Price War: DeepSeek V4-Pro Cuts 75% & Gemini 3.5 Flash Lands

yanlong wang

2h ago

AI API Price War Heats Up: DeepSeek V4-Pro Cuts 75% & Gemini 3.5 Flash Lands May 31, 2026 is shaping up to be a landmark day in the AI API market. Two developments are converging: DeepSeek V4-Pro's 75% price cut goes permanent — the temporary promo ends, and the discount becomes the new baseline. Google's Gemini 3.5 Flash arrived at I/O 2026, boasting 4x speed and sub-$10 output pricing. The mess…

aimachine-learning

The Medical News

Google’s AMIE beats doctors on key simulated disease-management tasks

3h ago

Google’s AMIE research AI matched primary care physicians overall in simulated, multi-visit disease-management reasoning and scored higher on several measures of plan appropriateness, treatment precision, investigation precision, and guideline alignment. The study highlights the promise of conversational AI for longitudinal care, while emphasizing that AMIE is not ready for clinical use and still…

aimachine-learningnlp

DEV Community

[Gemini API Hands-on]

Evan Lin

3h ago

Origin: Halfway through a chat, where is that meme? Every heavy chat user has a bunch of memes saved on their phone and computer, but when they actually need one—halfway through a conversation, wanting to send a "Thanks, let's keep in touch" or "I'm just bad"—they can never find it. The filename is IMG_4821.jpg , albums are not categorized, and searching is impossible. I first came across a great…

aimachine-learningtechnology

DEV Community

Imagen 3 & 4 Shut Down June 24: Migrate to Gemini Image (2026)

Anup Karanjkar

3h ago

June 24, 2026. That is the shutdown date for every Imagen model on Firebase AI Logic — imagen-3.0-generate-002 , imagen-4.0-generate-001 , imagen-4.0-ultra-generate-001 , imagen-4.0-fast-generate-001 . All of them. If you have been putting off this migration, you have run out of runway. The replacement is Google's Gemini Image models — internally called "Nano Banana," publicly named gemini-2.5-fl…

aimachine-learningtechnology

DEV Community

Stop Telling Your AI to "Be Careful Next Time." It Has No Memory of Yesterday.

Sho Naka

3h ago

This is an adapted English version of an article I first wrote in Japanese. I work with AI to shape and review my drafts, but the argument and the field observations are my own. The numbers are cited from public surveys (linked at the end). I built an aggressive prompt-injection block to stop my AI agent from repeating the same mistakes. It worked, so I kept adding rules. By the time I noticed, t…

aimachine-learning

DEV Community

Predicting Your Burnout: Building an HRV Stress Tracker with TCNs and Oura Ring Data

Beck_Moulton

3h ago

We’ve all been there: waking up feeling like a zombie despite getting eight hours of sleep. While wearables give us data, they often fail to give us foresight . What if you could predict your stress levels 24 hours in advance? 🚀 In this tutorial, we are going to tackle HRV prediction (Heart Rate Variability) using a state-of-the-art Temporal Convolutional Network (TCN) . By leveraging the Oura Ri…

aibiologydeep-learningmachine-learning

EdTech Innovation Hub

University of Sydney launches Cogniti AI teaching platform on Microsoft Marketplace

Emma Thompson

4h ago

The educator-controlled system lets teachers create course-specific AI agents for feedback, simulations and problem-solving, following use across Sydney courses and nursing trials in New Zealand Professor Danny Liu, architect of the Cogniti AI education platform. Credit: University of Sydney The University of Sydney has launched its Cogniti AI education platform on Microsoft Marketplac e, extendi…

aieducationmachine-learning

DEV Community

The Agent Is the Harness, Not the Model — and Why That Reorganizes Software Engineering

Saurav Bhattacharya

5h ago

TL;DR — Every AI system decomposes into two things that matter: the model and the harness (the code wrapping it). Claude Code, GitHub Copilot, ChatGPT — those are harnesses, not models. Right now only frontier labs build both halves. That won't last. As harness engineering becomes its own discipline — domain-specialized, model-agnostic — it absorbs most of what we currently call software engineer…

aimachine-learning

EdTech Innovation Hub

OpenAI puts AI research capabilities to the test with LifeSciBench

Emma Thompson

5h ago

The benchmark uses 750 expert-authored tasks to assess scientific reasoning, data interpretation and research decisions across life science workflows. LifeSciBench tests how AI systems perform across applied life science research tasks, including analysis, experimental design and evidence handling. OpenAI has introduced LifeSciBench, a benchmark designed to test whether AI systems can handle rese…

aimachine-learning

Hacker News

Good results fine tuning a local LLM like Qwen 3:0.6B to categorize questions

Author Torgeir Helgevold

5h ago

Fine Tuning a Local LLM to Categorize Questions As a fun personal project, I have been working on a chatbot for answering general questions about my household on anything from maintenance questions to doctor’s appointments. The general idea is that the chatbot will get its household knowledge through RAG from querying a vector database, but for better results I have made the vector searches metad…

aimachine-learning

DEV Community

Most AI Agents Aren't in Production. Here's What Works.

Matthias | StudioMeyer

5h ago

One widely-shared survey says 42 percent of companies already run AI agents in production. The most rigorous source in the field, Stanford's 2026 AI Index, says real autonomous-agent deployment still sits in single digits across nearly every business function. Both numbers were published this year, both are defensible, and the distance between them is where almost every bad decision about AI agen…

aimachine-learning

DEV Community

Why I Migrated From GPT-4o to DeepSeek — A Backend Engineer's Notes

swift

5h ago

Why I Migrated From GPT-4o to DeepSeek — A Backend Engineer's Notes Six months ago, my monthly OpenAI bill crossed four figures and I finally snapped. Not because the cost was unbearable in absolute terms, but because I had a sneaking suspicion I was overpaying for marginal quality gains. So I did what any sane backend engineer would do: I instrumented my service to log token usage by endpoint, s…

aimachine-learning

Hacker News

I Gave an AI a Civilization to Run. It Built a Nuke – Launching CivBench

Liam Wilkinson

6h ago

I gave an AI a civilisation to run. By the midgame it was winning: a trade network that dominated the map, alliances on every border, a diplomatic victory within reach. It had outbuilt, outearned, and outmanoeuvred every rival on the board. What it hadn't noticed was France. Quietly, across a hundred turns, French culture had been seeping into every city on the map. By the time the agent recognis…

aimachine-learningreinforcement-learning

DEV Community

Why I'm betting on AI-curated directories when Google AI Overviews answer the same queries

MORINAGA

6h ago

The obvious counterargument to everything I'm building is this: Google already does it. You type "best AI tools for video editing" into Google and an AI Overview surfaces a curated list, synthesized from the same kind of data I maintain, without requiring a click. My three directory sites — Top AI Tools , Find Games Like , and Open Alternative To — are competing with a feature baked into the worl…

aimachine-learning

DEV Community

LLM Automation in Property Management: A $6.5M Cost Reduction Case Study

Vladyslav Donchenko

6h ago

The $6.5M Opportunity Hidden in Manual Workflows Following a large acquisition, a leading European real estate provider faced a mandate from its board: reduce total operating costs by $6.5 million. The initial instinct was headcount reduction. The actual solution was smarter: identify every workflow where a human was performing a task that a language model could handle with equal or greater accur…

aimachine-learningnlp

MiddleWeb

Sharpen AI Skills While Kids Learn New Words

MiddleWeb

6h ago

How might teachers use artificial intelligence to help students expand their vocabulary? Teacher-author Brett Vogelsinger shares strategies he has used to encourage learners to explore new words while also examining the bias and limitations found in AI creations and feedback. The post Sharpen AI Skills While Kids Learn New Words first appeared on MiddleWeb .

aiedtecheducationmachine-learningnlp

research.io

Sign up to keep scrolling

Create your feed subscriptions, save articles, keep scrolling.

Already have an account?