SoundBubble🫧

This is a reference implementation of "Daehwa Kim and Chris Harrison, SoundBubble: Finger-Bound Virtual Microphone using Headset/Glasses Beamforming (CHI 2026)". SoundBubble leverages an array microphone, acoustic beamforming, and hand tracking to capture acoustic signals coming from the hand during user interaction, and provides useful input sensing in XR.

Find details on our website: https://daehwakim.com/soundbubble

Preview of Repo

We provide a pipeline to listen to beamformed output audio with a sound bubble, visualize signal inputs to models, and open weights for the swiping model to support user drawing in the world.


Press [b]: Toggle sound bubble. When the bubble is on, SoundBubble's isolated audio is played	Press [m]: Toggle sound pressure map. The location of the audio source is visualized	Press [s]: Load a swiping model for live prediction so you can draw in the world

Pre-requisites

Get UMA-16 array microphone
Attach the microphone to the Meta Quest 3S and follow the calibration instructions described in our paper

Installation

We need Python for audio processing + machine learning and Unity for hand tracking.

Python

python3.9 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
export OPENBLAS_NUM_THREADS=1

Unity

Install Unity Hub and open SoundBubble_Unity. We used Editor version 6000.0.26f1.

If you are on macOS, change the serverIP (line 14) in script SoundBubble_Unity/Assets/Scenes/ArrayReceiver.cs to your laptop address. Then, go File>Build Profiles>Android to build the project to Meta Quest.

If you are on Windows, change the serverIP (line 14) in script SoundBubble_Unity/Assets/Scenes/ArrayReceiver.cs to localhost. Then, hit the play button to upload the Unity app to Quest.

Quick Start

python main.py

Then the script will show terminal output to select audio devices and a user interface for SoundBubble. To activate the SoundBubble interface, select the desired audio input and output. For example:

Available Input (Microphone) Devices:
Index 0: Daehwa’s iPhone Microphone
    Input channels: 1
    Sample rate: 48000.0
Index 1: UMA16v2
    Input channels: 16
    Sample rate: 44100.0
Index 2: MacBook Air Microphone
    Input channels: 1
    Sample rate: 48000.0

Available Output (Playback) Devices:
Index 1: UMA16v2
    Output channels: 2
    Sample rate: 44100.0
Index 3: MacBook Air Speakers
    Output channels: 2
    Sample rate: 48000.0

Select microphone device index: 1
Select headphone device index: 3

The input should be UMA16v2 (our 16-channel microphone) and the output can be any device.

Four panels in the interface show model inputs and predictions.

Use the function keys to see different effects in your Unity app, as shown in Preview of Repo.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
SoundBubble_Unity		SoundBubble_Unity
best_models		best_models
image		image
mic_models		mic_models
.gitignore		.gitignore
README.md		README.md
beamforming.py		beamforming.py
dataloader.py		dataloader.py
inference.py		inference.py
main.py		main.py
mic_calibrate.py		mic_calibrate.py
model.py		model.py
requirements.txt		requirements.txt
server.py		server.py
stream_audio.py		stream_audio.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SoundBubble🫧

Preview of Repo

Pre-requisites

Installation

Python

Unity

Quick Start

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

SoundBubble🫧

Preview of Repo

Pre-requisites

Installation

Python

Unity

Quick Start

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages