Every AI feature I've worked on has done the same quiet thing: collect the user's text, send it to someone else's server, pay per token, and pray the network holds. That's fine until it isn't: Your user is on a flight, with no network and a dead feature. It's a journaling app, where "we send your private thoughts to a third party" is a hard no. Finance notices the OpenAI bill climbing in a straight line with usage. There's another option most React Native devs still treat as exotic: run the mode

Six Lines, Zero API Calls: Running LLMs On-Device in React Native
Vikrant Negi
