How do I monitor audio levels in real time using Java Sound API?

Monitoring audio levels in real time is useful for applications like voice recorders, streaming tools, or any app that displays a volume meter. In Java, this is possible using the javax.sound.sampled package, specifically with the TargetDataLine class.

In this post, you’ll learn how to:

  • Capture audio input from a microphone
  • Convert it into byte data
  • Calculate the current audio level (amplitude)
  • Display the level in real time (console bar graph style)

Step 1: Setup Required Imports

import javax.sound.sampled.*;

Step 2: Open the Microphone (TargetDataLine)

You’ll need to configure and open a TargetDataLine with a supported audio format:

AudioFormat format = new AudioFormat(44100.0f, 16, 1, true, true);
DataLine.Info info = new DataLine.Info(TargetDataLine.class, format);

TargetDataLine line = (TargetDataLine) AudioSystem.getLine(info);
line.open(format);
line.start();

Step 3: Read and Analyze Audio Data in Real Time

We’ll continuously read short chunks of audio and calculate the volume level based on the root-mean-square (RMS) of the signal.

byte[] buffer = new byte[1024];
int bytesRead;

System.out.println("Monitoring audio levels... (Ctrl+C to stop)");

while (true) {
    bytesRead = line.read(buffer, 0, buffer.length);

    // Convert bytes to amplitude
    double sum = 0.0;
    for (int i = 0; i < bytesRead; i += 2) {
        // Convert byte pair to int
        int sample = (buffer[i] << 8) | (buffer[i + 1] & 0xFF);
        sum += sample * sample;
    }

    double rms = Math.sqrt(sum / ((double) bytesRead / 2));
    double db = 20 * Math.log10(rms); // Convert to decibels

    // Visualize as a simple bar
    int level = (int) (db + 50); // Normalize range
    level = Math.max(0, Math.min(50, level));
    System.out.println("[" + "*".repeat(level) + "]");
}

Step 4: Clean Up

You should close the audio line when you’re done:

line.stop();
line.close();

Notes and Tips

  • The audio input format is 44.1 kHz, 16-bit, mono, signed, big-endian. You can change it to suit your needs.
  • The loop runs indefinitely. You may want to run it on a background thread and provide a stop condition.
  • For better GUI visualization, consider integrating with Swing or JavaFX.

Summary

You’ve just created a simple Java program that listens to microphone input and prints real-time audio level feedback. This can be used as the foundation for:

  • Voice activity detection
  • Audio visualizers
  • Mute detection
  • Noise level meters

How do I save the microphone audio as a proper WAF file?

To save the microphone audio as a proper WAV file, you need to use the AudioSystem.write() method. WAV files contain raw PCM data combined with a header that describes important details, such as the sample rate, number of channels, etc. Java’s javax.sound.sampled package makes it easy to save the audio in this format.


Example: Saving Captured Audio as a WAV File

Here’s how you can save audio directly as a WAV file while using TargetDataLine:

package org.kodejava.sound;

import javax.sound.sampled.*;
import java.io.File;
import java.io.IOException;

public class MicrophoneToWav {

    public static void main(String[] args) {
        new MicrophoneToWav().start();
    }

    public void start() {
        // Define the audio format
        AudioFormat audioFormat = new AudioFormat(
                AudioFormat.Encoding.PCM_SIGNED, // Encoding
                44100.0f, // Sample rate (44.1kHz)
                16,       // Sample size in bits
                2,        // Channels (stereo)
                4,        // Frame size (16 bits/sample * 2 channels)
                44100.0f, // Frame rate (matches sample rate for PCM)
                false     // Big-endian (false = little-endian)
        );

        // Get and configure the TargetDataLine
        TargetDataLine microphone;
        try {
            microphone = AudioSystem.getTargetDataLine(audioFormat);
            microphone.open(audioFormat);

            File wavFile = new File("D:/Sound/output.wav");

            // Start capturing audio
            microphone.start();
            System.out.println("Recording started... Press Ctrl+C or stop to terminate.");

            // Set up a shutdown hook for graceful termination
            Runtime.getRuntime().addShutdownHook(new Thread(() -> stop(microphone)));

            // Save the microphone data to a WAV file
            writeAudioToWavFile(microphone, wavFile);

        } catch (LineUnavailableException e) {
            e.printStackTrace();
        }
    }

    private void writeAudioToWavFile(TargetDataLine microphone, File wavFile) {
        try (AudioInputStream audioInputStream = new AudioInputStream(microphone)) {
            // Write the stream to a WAV file
            AudioSystem.write(audioInputStream, AudioFileFormat.Type.WAVE, wavFile);
        } catch (IOException e) {
            e.printStackTrace();
        } finally {
            stop(microphone);
        }
    }

    public void stop(TargetDataLine microphone) {
        if (microphone != null && microphone.isOpen()) {
            microphone.flush();
            microphone.stop();
            microphone.close();
            System.out.println("Microphone stopped.");
        }
    }
}

Explanation

  1. Audio Format:
    • The AudioFormat specifies PCM encoding with a sample rate of 44100 Hz, 16-bit samples, 2 channels (stereo), and little-endian format.
  2. TargetDataLine:
    • A TargetDataLine is used to read audio data from the microphone.
  3. AudioInputStream:
    • The AudioInputStream wraps the TargetDataLine, creating a stream of audio data in chunks.
  4. AudioSystem.write():
    • The AudioSystem.write() method writes the audio stream directly to a .wav file using AudioFileFormat.Type.WAVE.
    • WAV files are chunks of PCM raw data with a proper header. This method handles creating the header for you.
  5. Shutdown Hook:
    • A shutdown hook ensures that resources (like the microphone) are released when the application stops or when the user presses Ctrl+C.
  6. Graceful Stop:
    • The stop() method safely terminates the recording loop and releases resources, such as the TargetDataLine.

How do I capture microphone input using TargetDataLine?

To capture microphone audio input using the TargetDataLine class in Java, you can use the javax.sound.sampled package. Here’s a step-by-step explanation of how you can achieve this:

Steps to Capture Microphone Input

  1. Prepare the Audio Format: Define an AudioFormat object, specifying the audio sample rate, sample size, number of channels, etc.
  2. Get the TargetDataLine: Use AudioSystem to obtain and open a TargetDataLine.
  3. Start Capturing Audio: Begin capturing audio from the TargetDataLine.
  4. Read Data from the Line: Continuously read data from the TargetDataLine into a byte buffer.
  5. (Optional) Save the Data: Write the captured audio data to a file or process it as needed.

Example Code

Below is a complete example of how to capture microphone input using TargetDataLine:

package org.kodejava.sound;

import javax.sound.sampled.*;
import java.io.ByteArrayOutputStream;
import java.io.File;
import java.io.FileOutputStream;
import java.io.IOException;

public class MicrophoneCapture {

    // Volatile flag for ensuring proper thread shutdown
    private volatile boolean running;

    public static void main(String[] args) {
        new MicrophoneCapture().start();
    }

    public void start() {
        // Define the audio format
        AudioFormat audioFormat = new AudioFormat(
                AudioFormat.Encoding.PCM_SIGNED, // Encoding
                44100.0f, // Sample rate (44.1kHz)
                16,       // Sample size in bits
                2,        // Channels (stereo)
                4,        // Frame size (frame size = 16 bits/sample * 2 channels = 4 bytes)
                44100.0f, // Frame rate (matches sample rate for PCM)
                false     // Big-endian (false = little-endian)
        );

        // Get and configure the TargetDataLine
        TargetDataLine microphone;
        try {
            microphone = AudioSystem.getTargetDataLine(audioFormat);
            microphone.open(audioFormat);

            // Start capturing audio
            microphone.start();
            System.out.println("Recording started... Press Ctrl+C or stop to terminate.");

            // Register a shutdown hook for graceful termination
            Runtime.getRuntime().addShutdownHook(new Thread(() -> {
                stop(microphone);
                System.out.println("Recording stopped.");
            }));

            // Start capturing in another thread
            captureMicrophoneAudio(microphone);

        } catch (LineUnavailableException e) {
            e.printStackTrace();
        }
    }

    private void captureMicrophoneAudio(TargetDataLine microphone) {
        byte[] buffer = new byte[4096];
        ByteArrayOutputStream outputStream = new ByteArrayOutputStream();

        running = true;

        // Capture audio in a loop
        try (microphone) {
            while (running) {
                int bytesRead = microphone.read(buffer, 0, buffer.length);
                if (bytesRead > 0) {
                    outputStream.write(buffer, 0, bytesRead);
                }
            }

            // Save captured audio to a raw file
            saveAudioToFile(outputStream.toByteArray(), "D:/Sound/output.raw");

        } catch (Exception e) {
            e.printStackTrace();
        }
    }

    private void saveAudioToFile(byte[] audioData, String fileName) {
        try (FileOutputStream fileOutputStream = new FileOutputStream(new File(fileName))) {
            fileOutputStream.write(audioData);
            System.out.println("Audio saved to " + fileName);
        } catch (IOException e) {
            e.printStackTrace();
        }
    }

    public void stop(TargetDataLine microphone) {
        running = false; // Stop the loop
        if (microphone != null && microphone.isOpen()) {
            microphone.flush();
            microphone.stop();
            microphone.close();
        }
    }
}

Explanation

  1. Audio Format: The AudioFormat object defines the format of the captured audio (e.g., PCM encoding, 44.1 kHz sample rate, 16-bit sample size, stereo channels).
  2. TargetDataLine Setup: TargetDataLine is the primary interface to access audio input lines, such as the microphone. The open() method ensures it’s properly configured with the specified format.
  3. Reading Audio Data: Data from the microphone is captured into a byte[] buffer using the read() method.
  4. Saving the Audio: The audio data can be saved to a file (e.g., .raw for raw PCM data).

Points to Note

  • Permissions: Ensure your application has permission to access the microphone, particularly when running on platforms like macOS or Windows.
  • Audio Processing: If you need further audio processing (e.g., writing to a WAV file), you’ll need to add additional logic to wrap the raw PCM data in a WAV file format header.
  • Thread Safety: For a real-time application, consider running the audio capture logic in a separate thread.