Fuzzing Android Libraries with LibAFL QEMU Mode

In this post, we’ll build a LibAFL QEMU-mode fuzzer from scratch, walking through it block by block. We’ll also cover advanced "gotchas," such as getting AddressSanitizer (ASan) to play nicely with Android's Bionic libc inside QEMU.

Feb 23

LibAFL is a modern, high-performance fuzzing framework written in Rust. Instead of being a single “one size fits all” tool like AFL++ or libFuzzer, it’s built as a set of reusable components (Rust crates) that you can mix and match.

Because of that modular design, security researchers can assemble fuzzers that fit a specific target and workflow whether that means snapshot-based fuzzing, custom coverage or crash feedback, or even combining fuzzing with symbolic execution without needing to fork and heavily modify a huge C codebase.

https://github.com/AFLplusplus/LibAFL

Please follow below url for installing Dependencies for Libafl
https://github.com/AFLplusplus/LibAFL?tab=readme-ov-file#building-and-installing

Extreme Customizability:You build your fuzzer out of blocks (fuzzer, Schedulers, Observers, Feedbacks). If you need a custom mutation strategy, you simply write a Rust function and plug it in. https://docs.rs/libafl/0.15.3/libafl/#modules
Speed: Written in Rust with minimal overhead, allowing for execution speeds that often beat traditional C fuzzers.
Rust Safety: Fuzzer development is prone to bugs. Rust's memory safety guarantees eliminate a huge class of common errors when building custom fuzzing harnesses.
Multi-Platform / Multi-Architecture: Out-of-the-box support for Windows, Linux, Android, and macOS, alongside powerful dynamic binary instrumentation (DBI) backends like Frida and QEMU

LibAFL QEMU mode integrates the legendary QEMU emulation engine natively within LibAFL. Specifically, using QEMU User-Mode emulation, it allows you to run binaries compiled for a different architecture (e.g., AArch64 Android) directly on your host machine
The big win is speed and feedback: instead of launching a fresh QEMU process for every single test case (which is painfully slow), LibAFL keeps the emulator running and drives executions from within the fuzzing loop. Under the hood, QEMU’s dynamic translation engine (TCG) tracks which blocks/edges run, and LibAFL hooks that coverage directly into its observer maps so your fuzzer can make smart, coverage-guided decisions.

Think of this as: corpus + mutation + coverage feedback + scheduling + fuzz loop.

Corpus (where inputs are stored)

Corpus: trait for “a place to store testcases”.
InMemoryOnDiskCorpus: keeps metadata in RAM but stores actual inputs on disk (fast + persistent).
OnDiskCorpus: pure on-disk storage (commonly used for crashes/).

Events + monitor output

SimpleEventManager: handles communication between fuzzer components; for single-process fuzzing it’s the simplest option.
SimpleMonitor: prints stats (execs/sec, corpus size, crashes).

Inputs

BytesInput: the common “raw bytes” input type (like a file).
HasTargetBytes: a trait to access the bytes inside an input (so your harness can read them).

Observers (what you measure during execution)

VariableMapObserver: wraps a raw coverage map buffer (“edges map”).
HitcountsMapObserver: converts a raw bitmap into hitcount buckets (better feedback).
TimeObserver: measures execution time of each run.
CanTrack: allows tracking changed indices in a map (useful for minimization/scheduling).

Feedbacks (how you decide an input is “interesting”)

MaxMapFeedback: classic coverage feedback (“did this input increase coverage?”).
TimeFeedback: treats slower inputs as interesting (optional, helps explore deep paths).
CrashFeedback: marks crashes as objectives.
TimeoutFeedback: marks hangs/timeouts as objectives.

feedback_or / feedback_or_fast: combine feedbacks (e.g., coverage OR time; crash OR timeout).

Scheduling + stages + mutators

QueueScheduler: basic FIFO scheduling.
IndexesLenTimeMinimizerScheduler: wrapper that tends to prefer smaller/faster inputs when coverage is comparable (helps keep corpus efficient).
havoc_mutations: the standard mutation set (bit flips, byte flips, insert/delete, etc.).
HavocScheduledMutator: applies those mutations.
StdMutationalStage: a “stage” that mutates and executes inputs.

State + fuzzer wrapper

StdState: stores RNG, corpus, solutions, and feedback state.
HasCorpus: trait to access corpus from state.
StdFuzzer: a ready-made fuzzer pipeline built from scheduler + feedback + objective.
Fuzzer: trait implemented by fuzzers (so you can call fuzz_loop).
Error: LibAFL’s error type.

This is the part that makes “QEMU mode fuzzing” happen.

ELF symbol resolution

EasyElf: parses the target ELF so you can resolve symbols like LLVMFuzzerTestOneInput and compute addresses correctly (often with load base).

Coverage modules

EdgeCoverageModule / StdEdgeCoverageModule: QEMU-side coverage instrumentation that fills your edges map.
EdgeCoverageFullVariant: the “flavor” of edge coverage you’re using (full variant usually means richer coverage tracking).

Filters (optional optimization)

NopAddressFilter / NopPageFilter: “no filtering” filters (instrument everything).
Later you can swap these for real filters to ignore library code or reduce noise.

Core QEMU types

Emulator: constructs and configures the QEMU instance + modules.
Qemu: handle to the running QEMU engine (read/write memory, set breakpoints, run).
QemuExecutor: the LibAFL “executor” that bridges harness execution into LibAFL.
QemuExitReason: why execution stopped (breakpoint, crash, etc.).
TargetSignalHandling: how QEMU treats signals/crashes (e.g., return to harness).
Regs: register identifiers (PC/SP/LR/args depending on arch).
GuestAddr / GuestReg: types for guest addresses and registers.
MmapPerms: memory permissions for guest mappings (RWX).

Arch helpers

ArchExtras: architecture-specific conveniences (AArch64 calling conventions, etc.).

cap the input size (1 MiB) so you don’t write past the mapped buffer
write bytes to guest memory
restore execution context (PC/SP/RA)
set `arg0=data_ptr`, `arg1=size` following the guest ABI
run QEMU until:
we hit the return breakpoint → OK
we crash → Crash
anything else → treat as OK (you can refine this later)

If your fuzzing loop “doesn’t do anything”, this is the first place I look:

are we writing input to the right place?
are we setting PC correctly?
are we actually stopping at the right return address?

In Cargo.toml, the Crates and Dependencies section defines the core building blocks used by the fuzzer. clap (with the derive feature) is used to create a clean command-line interface for options like target path, corpus directories, and timeouts. log and env_logger provide lightweight logging so you can enable useful runtime output with RUST_LOG=info without hardcoding print statements. The main fuzzing logic comes from libafl, which provides the framework pieces like corpus management, schedulers, mutators, observers, feedback, and the fuzz loop itself. libafl_bolts adds shared utilities (randomness, tuple helpers, safe-ish map wrappers, and better backtraces via errors_backtrace) that LibAFL uses across components. libafl_qemu (enabled with usermode) is the key dependency that embeds QEMU user-mode emulation into LibAFL so the fuzzer can execute foreign-architecture binaries while collecting coverage. Finally, libafl_targets provides the standard shared coverage map layouts and target-side helpers that integrate neatly with LibAFL’s observers and feedback system.

In this blog post, we built a simple and practical workflow for fuzzing AArch64 Android binaries using LibAFL in QEMU user-mode. We started by understanding how LibAFL works with its modular design and how QEMU helps us run binaries from a different architecture while still getting useful coverage feedback. Then we brought everything together—writing a basic harness, targeting a custom native library, and connecting the core LibAFL components that make the fuzzing loop run.

After getting the fuzzer running, we focused on what really matters: analyzing the results. By going through the objectives and replaying crashes using the harness, we were able to verify and better understand the issues we found. This gives us a complete, end-to-end setup—from taking an Android .so file to actually finding and confirming crashes—without needing to run a full Android system.

In the next blog post, we’ll go one step further by fuzzing a real-world Android library using LibAFL in Frida mode, where we can work directly with live processes and explore deeper, more realistic execution paths.

Check out our courses that cover full chain exploitation,

Android Userland Fuzzing and Exploitation
https://www.mobilehackinglab.com/afe-promo

Android Kernel Fuzzing and Exploitation
https://www.mobilehackinglab.com/course/android-kernel-fuzzing-and-exploitation

Get both together with a special bundle sale with 60% discount!
https://www.mobilehackinglab.com/bundles?bundle_id=android-kernel-userland

Want to learn to Chained AppSec bugs to get remote code execution?

Advanced Android Hacking - Road to Pwn2Own
https://www.mobilehackinglab.com/course/advanced-android-hacking

Fuzzing Android Libraries with LibAFL QEMU Mode

Basic Fuzzing of Native Programs

Introduction to LibAFL

What are the advantages of using LibAFL?

What is LibAFL QEMU Mode?

Overview

The Target Library

Creating the Harness

Implementing the Fuzzer, Step-by-Step

The fuzzer entrypoint (main.rs):

The real fuzzer: walking through `fuzzer_impl.rs`

Standard Rust utilities

CLI parsing

LibAFL core (the fuzzing framework “engine”)

libafl_bolts (utility toolbox)

libafl_qemu (the QEMU backend)

Bootstrapping QEMU and Finding Entry Points

What's Happening:

Resolving LLVMFuzzerTestOneInput

Setting up edge coverage

Installing the EdgeCoverageModule

Breakpoints: the “return-to-harness”

Mapping guest memory for inputs

Snapshotting registers (so each input starts clean)

The harness closure

Feedback vs Objective (coverage vs “findings”)

State + corpus

Executor: the QEMU bridge into LibAFL

Mutation stage: “havoc” is the classic starter

Fuzz loop: the part that’s almost boring (in a good way)

fuzzer_impl.rs

Cargo.toml: Crates and Dependencies

Build the fuzzer

Run the Fuzzer

Analyse Crashes

Conclusion

Socials

Resources

Legal

Company