/

/

/

/

/

1/29/2026

Online

LLM Inference On-Device in React Native: The Practical Aspects

Date

Thursday, January 29, 2026

Time

Location

Online

LLM Inference On-Device in React Native: The Practical Aspects

A practical look at reliability, performance, libraries, and tradeoffs when running LLM inference locally in React Native apps.

Date

29 January 2026

-

Time

Location

Online

LLM Inference On-Device in React Native: The Practical Aspects

youtube-cover

Video Unavailable

Organizer

Organizer

Presented

Callstack & Zalando

@

Speakers

Speakers

Featuring

Artur Morys-Magiera

Software Engineer

@

Callstack

Featuring

Artur Morys-Magiera

Software Engineer

@

Callstack

Artur Morys-Magiera explored what it actually means to run LLM inference directly on mobile devices in React Native applications. Instead of treating “AI” as a buzzword, he narrowed the focus to LLMs and examined why teams might move inference on-device: reliability without network dependency, stronger privacy guarantees, and lower latency without cloud queues. He also highlighted real user-facing constraints, including model size, disk usage, and hardware variability across iOS and Android devices.

From there, the talk moved into the practical engineering layer: hardware acceleration (GPU, NPU, CPU), runtime fragmentation, debugging challenges with abstraction layers like OpenCL, and real-world performance issues traced to memory layout differences. Artur compared available libraries, explained their tradeoffs, and showed how a unified API approach can simplify integration while still supporting optimizations such as quantization, compilation-time improvements, and model selection based on device capability.

What you’ll walk away with:

When running LLMs locally improves reliability, privacy, and latency
How model size and OS-provisioned models impact real user experience
Why hardware acceleration and device fragmentation shape performance decisions
The tradeoffs between TF Lite, ONNX, ExecuTorch, MLC, llama.cpp, and Apple-based solutions
How quantization, compilation optimizations, and unified APIs reduce integration risk

‍

Need help with React or React Native projects?

We support teams building scalable apps with React and React Native.

Link copied to clipboard!

Share

//

Save my spot

LLM Inference On-Device in React Native: The Practical Aspects

A practical look at reliability, performance, libraries, and tradeoffs when running LLM inference locally in React Native apps.

//

Insights

Learn more about React Native

Here's everything we published recently on this topic.

A Practical Guide to React Native Monorepo With Yarn Workspaces

Learn how to set up a React Native monorepo with Yarn workspaces for mobile and web, including Metro, Gradle, and shared packages.

Bringing CSS Clipping to React Native

A deep look at how a single CSS feature moves through React Native’s New Architecture, from JavaScript styles to native rendering on iOS and Android.

Live Activities and Widgets with React: Say Hello to Voltra

Voltra lets you build iOS Live Activities and widgets for React Native using JSX instead of Swift.

React Native Wrapped 2025: A Month-by-Month Recap of The Year

Seven major React Native releases in 2025, React Native turns 10, the New Architecture becomes the default, and 1.0 moves into view. A recap of 2025’s biggest RN updates.

How to Cleanly Swap Between React Native Storybook 10 and Your App

Set up React Native Storybook 10 with Expo or the Community CLI. Learn configuration steps, updated tools and a simple workflow for toggling Storybook.

Deep Links With Authentication in React Navigation

Learn various approaches to how to make deep links work when the screen is behind authentication in React Navigation.

React Native 0.84 and Other News

Join Kewin and Daniel for a developer-focused breakdown of the latest React and React Native news.

Performance Comparisons in Brownfield React Native Projects

How Zalando defined and instrumented a “meaningful render” metric to compare native and React Native screens in a brownfield app.

How to Optimize Your Health with React Native

Tejas shows how a React Native app uses HealthKit HRV trends to guide daily effort, plus the real production choices behind shipping it.

From Teddy Bears to Voice Agents: Kraen Hansen on Voice AI, Local-First & App Security

What happens when you combine voice agents and teddy bears? A hands-on look at local-first systems, voice AI, and security lessons from poking real apps.

It's Not New: How 'The Architecture' Unlocks React Native's Future

Riccardo Cipolleschi (Meta) explains why the New Architecture is now just The Architecture and how it serves as the foundation for concurrent rendering, DOM APIs, and Hermes V1.

State Management in React Native

Get a broader perspective on state management in React Native by listening to our experts discuss libraries like Redux, Mobx, XState, and Jotai.

February 10, 2026

What Is the React Native AI SDK? A Complete Intro & Quickstart

Learn why on-device AI matters for React Native apps, how local LLMs behave offline, and what problems React Native AI is designed to solve from day one.

January 29, 2026

Implementing an Android TurboModule with Kotlin

Learn how to implement a React Native TurboModule on Android, from the TypeScript spec and Codegen to the Kotlin implementation and package registration.

January 19, 2026

Working with Different Threads in Swift TurboModules

Learn how JavaScript, native module, background, and main threads interact in React Native TurboModules, and how to use GCD queues to run async work safely in Swift.

January 9, 2026

How to Add Type-Safe Constants to Swift TurboModules

Learn how to expose typed, immutable constants from a Swift TurboModule using Codegen, and get full end-to-end type safety between native and JavaScript.

December 30, 2025

Adding Event Emitters to Your TurboModule in Swift

Learn how to emit native events from a Swift TurboModule and subscribe to them in JavaScript, including proper memory management and Codegen integration.

December 22, 2025

Writing Your First TurboModule in Swift

Learn how to build a fully working TurboModule in Swift, integrate it with Codegen, and bridge it through Objective‑C inside an Expo app.

//

React Native

We can help you move
it forward!

At Callstack, we work with companies big and small, pushing React Native everyday.

Check our offer

React Native Performance Optimization

Improve React Native apps speed and efficiency through targeted performance enhancements.

New Architecture Migration

Safely migrate to React Native’s New Architecture to unlock better performance, new capabilities, and future-proof releases.

Code Sharing

Implement effective code-sharing strategies across all platforms to accelerate shipping and reduce code duplication.

Mobile App Development

Launch on both Android and iOS with single codebase, keeping high-performance and platform-specific UX.