2025-11-10T02:55:46.582245

Direction Estimation of Sound Sources Using Microphone Arrays and Signal Strength

Pour, Habibzadeh
Sound-tracking refers to the process of determining the direction from which a sound originates, making it a fundamental component of sound source localization. This capability is essential in a variety of applications, including security systems, acoustic monitoring, and speaker tracking, where accurately identifying the direction of a sound source enables real-time responses, efficient resource allocation, and improved situational awareness. While sound-tracking is closely related to localization, it specifically focuses on identifying the direction of the sound source rather than estimating its exact position in space. Despite its utility, sound-tracking systems face several challenges, such as maintaining directional accuracy and precision, along with the need for sophisticated hardware configurations and complex signal processing algorithms. This paper presents a sound-tracking method using three electret microphones. We estimate the direction of a sound source using a lightweight method that analyzes signals from three strategically placed microphones. By comparing the average power of the received signals, the system infers the most probable direction of the sound. The results indicate that the power level from each microphone effectively determines the sound source direction. Our system employs a straightforward and cost-effective hardware design, ensuring simplicity and affordability in implementation. It achieves a localization error of less than 6 degrees and a precision of 98%. Additionally, its effortless integration with various systems makes it versatile and adaptable. Consequently, this technique presents a robust and reliable solution for sound-tracking and localization, with potential applications spanning diverse domains such as security systems, smart homes, and acoustic monitoring.
academic

Direction Estimation of Sound Sources Using Microphone Arrays and Signal Strength

Basic Information

  • Paper ID: 2507.03466
  • Title: Direction Estimation of Sound Sources Using Microphone Arrays and Signal Strength
  • Authors: Mahdi Ali Pour (Sabancı University), Zahra Habibzadeh (University of Tehran)
  • Classification: cs.SD cs.SY eess.AS eess.SY
  • Publication Date: October 10, 2025 (arXiv version)
  • Paper Link: https://arxiv.org/abs/2507.03466
  • Code Link: https://github.com/mahdi943/soundlocalization

Abstract

This paper proposes a sound source direction estimation method based on three electret microphones. The method infers the most probable direction of a sound source by analyzing signals received by three strategically positioned microphones using a lightweight algorithm that compares the average power of the signals. The system employs simple and cost-effective hardware design, achieving localization error below 6 degrees and 98% accuracy. This technique has broad application potential in security systems, smart homes, and acoustic monitoring.

Research Background and Motivation

1. Problem Statement

This research addresses the sound-tracking problem, which involves determining the direction from which sound originates. Unlike complete sound source localization, sound source tracking focuses on identifying the direction of the sound source rather than its precise spatial location.

2. Problem Significance

Sound source direction estimation is critical in multiple application domains:

  • Smart Home Systems: Amazon Alexa and Google Assistant utilize this capability to determine user location within a room
  • Security Monitoring: Real-time response and resource allocation
  • Robotics: Search and rescue operations and autonomous navigation
  • Acoustic Monitoring: Environmental noise source tracking

3. Limitations of Existing Methods

Traditional sound source localization methods suffer from the following issues:

  • Hardware Complexity: Requires large microphone arrays and high-end processors
  • Computational Complexity: Methods such as TDOA and beamforming are computationally intensive
  • High Cost: Complex signal processing algorithms and hardware configurations
  • Poor Real-time Performance: Difficulty meeting real-time application requirements

4. Research Motivation

The authors aim to develop a lightweight, cost-effective alternative using minimal hardware components and simple signal processing algorithms, suitable for resource-constrained environments and rapid deployment scenarios.

Core Contributions

  1. Proposed a lightweight sound source direction estimation method based on signal strength: Achieves accurate direction detection using three microphones and a vector summation algorithm
  2. Designed a low-cost hardware system: Based on Arduino and electret microphones, significantly reducing implementation costs
  3. Achieved high-precision performance: Localization error below 6 degrees with 98% accuracy
  4. Verified practical applicability: Demonstrated method effectiveness and reliability in real-world testing
  5. Provided open-source implementation: Released complete code and hardware design specifications

Methodology Details

Task Definition

Input: Sound signals received by three microphones Output: Sound source direction angle and signal strength Constraints: Real-time processing, low-cost hardware, simple algorithms

Model Architecture

1. Hardware Configuration

  • Microphone Placement: Three electret microphones positioned at 0°, 120°, and 240°
  • Distance Setup: Each microphone 15cm from the center
  • Signal Amplification: LM358 operational amplifier for signal enhancement
  • Controller: Arduino Uno R3 microcontroller
  • Indicator: Servo motor pointing toward detected sound source direction

2. Algorithm Flow

The core algorithm is based on vector summation principles:

Algorithm 1: Sound Localization Algorithm
Input: Sound signal S, number of samples N, threshold T
Output: Sound angle, sound magnitude, servo motor position

1. if S > T then
2.   foreach microphone Mi do
3.     Arrayi ← Collect N samples from S
4.     Avgi ← Average of Arrayi
5.     Vi ← Polar vector from Avgi
6.     Ri ← Rectangular form of Vi
7.   end
8.   X ← Σ Ri[0]  // Sum of x-components
9.   Y ← Σ Ri[1]  // Sum of y-components
10.  ResultMag ← √(X² + Y²)
11.  ResultAngle ← atan2(Y,X)
12.  Servo ← ServoPos
13. end

3. Mathematical Model

Step 1: Polar Coordinate Representation Signals from three microphones are represented as polar coordinate vectors:

  • α = (120°, PowerAvg₁) (1)
  • β = (0°, PowerAvg₂) (2)
  • γ = (240°, PowerAvg₃) (3)

Step 2: Rectangular Coordinate Conversion

  • R₁ = (rα · cos(θα), rα · sin(θα)) (4)
  • R₂ = (rβ · cos(θβ), rβ · sin(θβ)) (5)
  • R₃ = (rγ · cos(θγ), rγ · sin(θγ)) (6)

Step 3: Vector Summation

  • X = Σᵢ₌₁³ Rxᵢ (7)
  • Y = Σᵢ₌₁³ Ryᵢ (8)

Step 4: Result Calculation

  • ResultMag = √(X² + Y²) (9)
  • ResultAngle = atan2(Y,x) (10)

Technical Innovations

  1. Simplified Vector Method: Avoids complex time delay estimation and phase calculations
  2. Signal Strength Comparison Strategy: Directly utilizes signal power rather than time differences
  3. Real-time Processing Capability: Real-time response at low sampling rates (50Hz)
  4. Cost-Optimized Design: Uses inexpensive off-the-shelf components

Experimental Setup

Dataset

  • Audio Source: 2-second audio clips played from a smartphone
  • Test Distance: Sound source 35cm from system center
  • Test Angles: Two target angles of 20° and 120°
  • Experimental Runs: 30 tests per angle, 60 total experiments

Evaluation Metrics

  1. Accuracy: Average deviation between estimated and true angles
  2. Precision: Standard deviation of measurement results
  3. Percentage Accuracy: Precision percentage based on 360° full range

Implementation Details

  • Sampling Strategy: Fixed number of sample collection
  • Threshold Setting: Predefined signal strength threshold
  • Data Processing: 6% trimming to remove outliers
  • Visualization: Polar scatter plots displaying results

Experimental Results

Main Results

Metric120°20°
Test Runs3030
Data After Trimming2626
Accuracy (degrees)5.267.11
Precision (degrees)3.264.01
Percentage Accuracy98.9%98.8%

Detailed Analysis

  1. 120° Test Results:
    • Average Error: 5.26°
    • Standard Deviation: 3.26°
    • Accuracy: 98.9%
  2. 20° Test Results:
    • Average Error: 7.11°
    • Standard Deviation: 4.01°
    • Accuracy: 98.8%

Visualization Results

Experimental results are displayed through polar scatter plots, showing:

  • Concentrated vector distribution in the target direction
  • Signal strength variations reflected in distance from the origin
  • Minor deviations caused by environmental noise and microphone sensitivity differences

Experimental Findings

  1. Good Consistency: Both test angles demonstrate high consistency in direction estimation
  2. Stable Accuracy: Over 98% accuracy validates method reliability
  3. Real-time Performance: System responds to sound source changes in real-time
  4. Hardware Limitations: Low sampling rate requires sound sources to be relatively close

Traditional SSL Methods

  1. TDOA Method: Calculates time-of-arrival differences of signals between microphones
  2. Phase Techniques: Utilizes phase shifts of audio signals between sensors
  3. Beamforming: Uses microphone arrays based on time signal alignment to estimate direction
  4. Bayesian Filtering: Improves accuracy but increases complexity

Modern Approaches

  1. Machine Learning Methods: Classify sound direction using raw or preprocessed signal data
  2. Particle Filtering: Real-time tracking in dynamic environments
  3. Signal Strength Comparison: Similar to this work but typically with lower accuracy

Advantages of This Work

Compared to existing methods, this paper's advantages include:

  • Minimal hardware requirements
  • Minimal algorithm complexity
  • Optimal cost-effectiveness
  • Lowest deployment difficulty

Conclusions and Discussion

Main Conclusions

  1. Feasibility Verification: Demonstrates the feasibility of accurate direction estimation using three microphones and simple algorithms
  2. Superior Performance: Achieves localization error below 6 degrees with 98% accuracy
  3. Cost-Effectiveness: Provides a low-cost alternative to traditional complex systems
  4. Practical Value: Applicable to multiple real-world scenarios

Limitations

  1. Distance Constraints: Due to Arduino Uno's low sampling rate limitation, sound sources must be relatively close (35cm)
  2. Environmental Requirements: Robustness in noisy environments requires further verification
  3. Hardware Constraints: Limited by Arduino's memory capacity and processing power
  4. Test Scope: Experiments conducted only in controlled environments, lacking large-scale real-world deployment validation

Future Directions

  1. Hardware Upgrades: Explore microcontrollers with higher sampling rates and stronger processing capabilities
  2. Algorithm Optimization: Improve algorithms to enhance robustness in noisy environments
  3. Extended Testing: Conduct tests in more realistic scenarios and noise conditions
  4. Multi-Sensor Fusion: Integrate additional sensors to improve accuracy

In-Depth Evaluation

Strengths

  1. Moderate Innovation with High Practicality: While technical innovation is limited, it demonstrates significant advantages in cost-effectiveness
  2. Reasonable Experimental Design: Scientific testing methodology with credible results
  3. High Practical Value: Provides feasible solutions for resource-constrained environments
  4. Open-Source Contribution: Provides complete code and hardware designs for reproducibility and improvement

Weaknesses

  1. Limited Technical Depth: Relatively simple algorithms lacking theoretical depth
  2. Limited Test Scope: Testing only under ideal conditions with insufficient real-world scenario validation
  3. Missing Comparative Analysis: Lacks direct comparison with other low-cost methods
  4. Insufficient Robustness Analysis: Limited analysis of handling capabilities for noise and multi-source scenarios

Impact

  1. Educational Value: Provides excellent case studies for teaching and prototype development
  2. Engineering Practicality: Demonstrates good practical value in specific application scenarios
  3. Cost Advantages: Offers feasible solutions for budget-constrained projects
  4. Inspirational Significance: Demonstrates the effectiveness of simple methods under specific conditions

Applicable Scenarios

  1. Educational Projects: University courses and student projects
  2. Prototype Development: Rapid concept verification and prototyping
  3. Smart Homes: Small-scale indoor sound source tracking
  4. Embedded Applications: Resource-constrained embedded systems
  5. DIY Projects: Hobbyist and maker projects

References

The paper cites 28 relevant references covering important works in sound source localization, signal processing, robotics, and other related fields, providing sufficient theoretical foundation and technical background for the research.


Overall Assessment: This is a highly practical engineering paper that, while limited in theoretical innovation, excels in cost-effectiveness and practicality. The work provides a simple, feasible, and cost-efficient solution for sound source direction estimation, particularly suitable for educational applications, prototype development, and resource-constrained scenarios.