On TechRepublic: 10 ways to survive office politics
BNET Business Network:
BNET
TechRepublic
ZDNet

March 19th, 2008

Fraunhofer IIS shows audio technology at VON.x 2008

Posted by George Ou @ 9:47 am

Categories: ~Events~, VoIP, VON, Audio Conferencing, Video Conferencing

Tags: Microphone, Speaker, Microsoft IIS Server, George Ou

In the world where chip technology improves exponentially, acoustic engineering isn’t so simple and it presents a huge hurdle to overcome to the world of telephony and video conferencing.  Fraunhofer IIS (inventors of MPEG-1 Layer 3 AKA MP3) seeks to tackle this challenge and showed off some of its research and upcoming products at VON.x 2008.

The first demonstration given to me was echo cancellation technology that prevents sound coming out of a speaker from reentering the microphone.  This is one of the most annoying things about using PC telephony like Skype since Skype lacks good echo cancellation technology[update 3/20/2008 - Recent versions of Skype now have very good echo cancellation on Windows, Mac, and Linux.  I still experienced some problems because the clients on the other end were using older versions of Skype.]  When you connect to someone using a speaker and microphone, you can often hear yourself talking a split second after you speak and it’s incredibly annoying.  With Fraunhofer’s echo cancellation technology, that problem virtually disappeared.

Now I’m fully aware that Skype is a free application but that hasn’t stopped open source solutions like Asterisk from offering licensed technology where a user for example pays $10 for the G.729 codec.  I’d gladly pay a little money for some good echo cancellation software.

The other cool demonstration was the discrete multichannel sound separation technology.  Normally when you’re in a room with multiple microphones being mixed in to a single sound channel and transmitted over a single audio channel, the sound is blurred.  But when the sound from each microphone and each person is sent in its own channel and played back from its own speakers, you can clearly hear each person speaking at the same time.  The downside of course is that each audio channel uses a separate 64 Kbps steam but that may not be a problem since it’s dwarfed by the video stream.

If you can’t spare the bandwidth and you only want to use a single 64 Kbps audio stream, Fraunhofer has another technology that can separate each person on to its own channel by marking the streams with few identifiers.  Once that’s done, each person can be moved from one sector to the other in a graphical interface shown below such that their sounds come out from the corresponding speakers.  While it wasn’t as pure as the discrete channel solution, it sounded almost as good because each person’s voice had its own dedicated speaker.  Just the act of using a physically different speaker cone per voice seems to have a huge impact on quality.

While this technology demo used 5 speakers, there’s no reason it can’t be made to work with the more typical stereo speaker set up.  I’d love to see audio conferencing bridges incorporate this technology such that multiple sound sources are marked for separate speakers so that they can be played back from separate speakers.

George Ou is Technical Director of ZDNet. See his full profile and disclosure of his industry affiliations.

  • Talkback
  • Most Recent of 7 Talkback(s)
MP3 is very old so it isn't a fair comparison
MP3 is VERY old so it isn't a fair comparison against a modern codec like OGG.

They showed Fraunhoffer's AAC low delay codec to me and it exhibited very good quality and very good robustness against packet loss.... (Read the rest)
Posted by: georgeou Posted on: 03/19/08 You are currently: Logged In | Log out
Um... Skype does have AEC spark555   | 03/19/08
Like the rest of Skype, it doesn't work well j.a.watson   | 03/19/08
MSN messenger has much better echo cancellation, but horrible codec georgeou   | 03/19/08
yeah, too bad it doesn't work at all georgeou   | 03/19/08
Fraunhofer uses their patents to impede innovation... sdunn2000@...   | 03/19/08
true... and ogg is a far better technology. james.faction   | 03/19/08
MP3 is very old so it isn't a fair comparison georgeou   | 03/19/08

What do you think?

One Trackback

The URI to TrackBack this entry is:
http://blogs.zdnet.com/Ou/wp-trackback.php?p=1066

  • Fraunhofer IIS shows audio technology at VON.x 2008 - ZDNet
    Fraunhofer IIS shows audio technology at VON.x 2008 ZDNet - 1 hour ago This is one of the most annoying things about using PC telephony like Skype since Skype lacks good echo cancellation technology. When you connect to someone … ...

    Trackback by recursosvoip.com news — March 19, 2008 @ 6:10 pm

advertisement

Recent Entries

Top Rated

    advertisement

    Archives

    ZDNet Blogs

    All-in-One Printers

    advertisement
    Click Here