Case ID: M09-118P

Published: 2011-01-25 10:51:45

Last Updated: 1700030365


Harish Krishnamoorthi
Visar Berisha
Andreas Spanias

Technology categories

Physical ScienceWireless & Networking

Licensing Contacts

Shen Yan
Director of Intellectual Property - PS
[email protected]

Sinusoidal Loudness Estimation

In recent years, the proliferation of internet streaming
applications has brought about the need for low-bit rate speech and audio coding
methods. Several parametric designs such as the sinusoids+transients+noise model
have been somewhat successful for speech and audio synthesis. Low-bit rate and
streaming applications are restricted by only being able to transmit a limited
number of parameters for loudness estimation, resulting in reduced audio
quality. Current techniques for selecting relevant parameters are based on
either Signal to Mask Ratio (SMR) or loudness patterns. Despite their
popularity, an SMR focus has a tendency to neglect perceptually relevant
sinusoids, and the loudness focused methods are computationally costly and
aren?t proficient in delay?critical applications.

Researchers at Arizona State University have created a novel
and efficient technique for estimating relevant perceptual quantities like
loudness patterns of individual sinusoids. This routine is helpful for accurate
parameter selection in low-bit rate speech and audio applications.

Potential Applications

  • Speech and Audio applications

    • MPEG-4 HVXC speech coder
    • MPEG-4 HILN audio coder

  • Low-Bit Rate applications

    • Digital Audio Broadcasting
    • Internet Streaming

  • Volume Control
  • Hearing Aid Technologies

Benefits and Advantages

  • 90% faster CPU time for algorithm execution
  • Sufficient computational efficiency for real-time use

  • Less than 1/2 the loudness error of common method as
    number of sinusoidal components selected increases