<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>auphonic.com - Entries for the category Development</title><link>https://auphonic.com/blog/categories/development/</link><description>The last entries categorized under Development</description><atom:link href="https://auphonic.com/blog/feeds/categories/development/" rel="self"/><language>en-us</language><copyright>Zinnia</copyright><lastBuildDate>Thu, 26 Mar 2026 07:04:07 +0000</lastBuildDate><item><title>Introducing the Auphonic Command Line Interface (CLI)
</title><link>https://auphonic.com/blog/2026/03/26/auphonic-cli/</link><description>
&lt;p&gt;
    Auphonic is now available from the command line: &lt;br&gt;
    The &lt;a href="https://auphonic.com/cli" target="_blank"&gt;Auphonic CLI&lt;/a&gt; lets you
    process, manage, and automate &lt;a href="https://auphonic.com/help/web/production.html" target="_blank"&gt;audio productions&lt;/a&gt;
    without leaving the terminal.
    It's a &lt;b&gt;free, single binary with no dependencies&lt;/b&gt; &amp;mdash;
    just download it, authenticate, and start processing.
  &lt;/p&gt;

  &lt;img src="https://auphonic.com/help/_images/cli-terminal.png" width="608" height="240" alt="Auphonic CLI Terminal" /&gt;

  &lt;p&gt;
    Everything you know from Auphonic is available:
    &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#noise-and-reverb-reduction" target="_blank"&gt;Noise Reduction&lt;/a&gt;,
    &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#loudness-specifications-and-true-peak-limiter" target="_blank"&gt;Loudness Normalization&lt;/a&gt;,
    &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html" target="_blank"&gt;Speech Recognition&lt;/a&gt;,
    &lt;a href="https://auphonic.com/help/algorithms/multitrack.html" target="_blank"&gt;Multitrack Processing&lt;/a&gt;,
    &lt;a href="https://auphonic.com/help/web/preset.html" target="_blank"&gt;Presets&lt;/a&gt;,
    and publishing to
    &lt;a href="https://auphonic.com/help/web/services.html" target="_blank"&gt;External Services&lt;/a&gt;.
    All from a single command.
  &lt;/p&gt;


  &lt;h3&gt;Use Cases&lt;/h3&gt;

  &lt;p&gt;
    Whether you're producing a single episode or processing thousands of files,
    the CLI fits into a wide range of workflows:
  &lt;/p&gt;

  &lt;p&gt;
    &lt;strong&gt;File-based workflows:&lt;/strong&gt;
    If you're a broadcaster, podcaster, or anyone working with file-based audio,
    the CLI lets you integrate Auphonic directly into your local workflow.
    Drop it into your existing production pipeline and process files
    right where they live &amp;mdash; no browser required.
  &lt;/p&gt;

  &lt;p&gt;
    &lt;strong&gt;DAW integration:&lt;/strong&gt;
    Pipe audio straight from ffmpeg, sox, or your DAW exports into Auphonic.
    This makes it easy to add post-processing as the final step
    in your editing workflow without switching tools.
  &lt;/p&gt;

  &lt;p&gt;
    &lt;strong&gt;Batch processing &amp;amp; scripting:&lt;/strong&gt;
    Wrap the CLI in a shell script, cron job, or CI/CD pipeline
    to process entire folders of files automatically.
    Great for teams that need to handle high volumes of audio
    with consistent quality settings.
  &lt;/p&gt;

  &lt;p&gt;
    &lt;strong&gt;Optimized for humans and agents:&lt;/strong&gt;
    A single command replaces what would otherwise be multiple API calls
    with HTTP requests, JSON payloads, and auth headers.
    The CLI is designed to be easy to use interactively,
    but its structured output also makes it a natural fit for
    AI agents and LLMs that need to integrate audio processing into their toolchains.
  &lt;/p&gt;


  &lt;h3&gt;Examples&lt;/h3&gt;

  &lt;p&gt;
    Processing a file is as simple as a single command.
    The CLI uploads your audio to Auphonic, applies our
    &lt;a href="https://auphonic.com/help/algorithms/singletrack.html" target="_blank"&gt;default audio algorithms&lt;/a&gt;
    automatically, and downloads the processed result back to your machine:
  &lt;/p&gt;

  &lt;div style="background: #111827; border-radius: 8px; overflow: hidden; margin: 1em auto; max-width: 680px;"&gt;
    &lt;div style="display: flex; align-items: center; gap: 6px; padding: 8px 16px; background: rgba(31, 41, 55, 0.6);"&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(239, 68, 68, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(234, 179, 8, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(34, 197, 94, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="margin-left: 8px; font-size: 12px; color: #6b7280; font-family: monospace;"&gt;Terminal&lt;/span&gt;
    &lt;/div&gt;
    &lt;div style="padding: 16px; font-family: monospace; font-size: 13px; line-height: 1.6;"&gt;
      &lt;p style="color: #6b7280; margin: 4px 0;"&gt;# Process an audio file, wait for completion, and download the result&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;span style="color: #4ade80;"&gt;$ &lt;/span&gt;&lt;span style="color: #f3f4f6;"&gt;auphonic process interview.wav --wait --download&lt;/span&gt;&lt;/p&gt;
      &lt;p style="color: #6b7280; margin: 4px 0;"&gt;Processing interview.wav...&lt;/p&gt;
      &lt;p style="color: #6b7280; margin: 4px 0;"&gt;Status: Done&lt;/p&gt;
      &lt;p style="color: #6b7280; margin: 4px 0;"&gt;Downloaded: interview-auphonic.mp3&lt;/p&gt;
    &lt;/div&gt;
  &lt;/div&gt;

  &lt;p&gt;
    &lt;a href="https://auphonic.com/help/web/preset.html" target="_blank"&gt;Presets&lt;/a&gt;
    save your production settings &amp;mdash; output formats, loudness targets, metadata, publishing destinations &amp;mdash;
    so you get consistent results across episodes without repeating yourself.
    The &lt;span style="font-family: monospace; background: #f3f4f6; padding: 1px 5px; border-radius: 3px; font-size: 13px;"&gt;--preset&lt;/span&gt; flag accepts either the preset name or its UUID.
    You can also pipe audio from other tools like ffmpeg:
  &lt;/p&gt;

  &lt;div style="background: #111827; border-radius: 8px; overflow: hidden; margin: 1em auto; max-width: 680px;"&gt;
    &lt;div style="display: flex; align-items: center; gap: 6px; padding: 8px 16px; background: rgba(31, 41, 55, 0.6);"&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(239, 68, 68, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(234, 179, 8, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(34, 197, 94, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="margin-left: 8px; font-size: 12px; color: #6b7280; font-family: monospace;"&gt;Terminal&lt;/span&gt;
    &lt;/div&gt;
    &lt;div style="padding: 16px; font-family: monospace; font-size: 13px; line-height: 1.6;"&gt;
      &lt;p style="color: #6b7280; margin: 4px 0;"&gt;# Apply a preset to your production&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;span style="color: #4ade80;"&gt;$ &lt;/span&gt;&lt;span style="color: #f3f4f6;"&gt;auphonic process episode.wav --preset "My Podcast" --wait --download&lt;/span&gt;&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;/p&gt;
      &lt;p style="color: #6b7280; margin: 4px 0;"&gt;# Pipe audio from ffmpeg&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;span style="color: #4ade80;"&gt;$ &lt;/span&gt;&lt;span style="color: #f3f4f6;"&gt;ffmpeg -i recording.mkv -vn -f wav - | auphonic process - --preset "My Podcast"&lt;/span&gt;&lt;/p&gt;
    &lt;/div&gt;
  &lt;/div&gt;

  &lt;p&gt;
    Need to check on past productions or grab a result file?
    The &lt;span style="font-family: monospace; background: #f3f4f6; padding: 1px 5px; border-radius: 3px; font-size: 13px;"&gt;list&lt;/span&gt; and &lt;span style="font-family: monospace; background: #f3f4f6; padding: 1px 5px; border-radius: 3px; font-size: 13px;"&gt;download&lt;/span&gt; commands let you
    quickly browse your production history and pull output files.
    Add &lt;span style="font-family: monospace; background: #f3f4f6; padding: 1px 5px; border-radius: 3px; font-size: 13px;"&gt;--open&lt;/span&gt; to open the production in your browser,
    or use &lt;span style="font-family: monospace; background: #f3f4f6; padding: 1px 5px; border-radius: 3px; font-size: 13px;"&gt;auphonic open UUID&lt;/span&gt; to jump to any production at any time:
  &lt;/p&gt;

  &lt;div style="background: #111827; border-radius: 8px; overflow: hidden; margin: 1em auto; max-width: 680px;"&gt;
    &lt;div style="display: flex; align-items: center; gap: 6px; padding: 8px 16px; background: rgba(31, 41, 55, 0.6);"&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(239, 68, 68, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(234, 179, 8, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(34, 197, 94, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="margin-left: 8px; font-size: 12px; color: #6b7280; font-family: monospace;"&gt;Terminal&lt;/span&gt;
    &lt;/div&gt;
    &lt;div style="padding: 16px; font-family: monospace; font-size: 13px; line-height: 1.6;"&gt;
      &lt;p style="color: #6b7280; margin: 4px 0;"&gt;# List recent productions&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;span style="color: #4ade80;"&gt;$ &lt;/span&gt;&lt;span style="color: #f3f4f6;"&gt;auphonic list&lt;/span&gt;&lt;/p&gt;
      &lt;p style="color: #6b7280; margin: 4px 0;"&gt;UUID                    Status    Title&lt;/p&gt;
      &lt;p style="color: #6b7280; margin: 4px 0;"&gt;aB3xKmNpQrStUvWxYz1234  Done      Episode 42&lt;/p&gt;
      &lt;p style="color: #6b7280; margin: 4px 0;"&gt;zY9wLcFgHjKmNpQrSt5678  Done      Interview&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;/p&gt;
      &lt;p style="color: #6b7280; margin: 4px 0;"&gt;# Download a production's output files&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;span style="color: #4ade80;"&gt;$ &lt;/span&gt;&lt;span style="color: #f3f4f6;"&gt;auphonic download aB3xKmNpQrStUvWxYz1234&lt;/span&gt;&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;/p&gt;
      &lt;p style="color: #6b7280; margin: 4px 0;"&gt;# Open a production in the browser&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;span style="color: #4ade80;"&gt;$ &lt;/span&gt;&lt;span style="color: #f3f4f6;"&gt;auphonic open aB3xKmNpQrStUvWxYz1234&lt;/span&gt;&lt;/p&gt;
    &lt;/div&gt;
  &lt;/div&gt;

  &lt;p&gt;
    &lt;a href="https://auphonic.com/help/algorithms/multitrack.html" target="_blank"&gt;Multitrack&lt;/a&gt;
    productions let you process separate audio tracks for each speaker or source &amp;mdash;
    Auphonic automatically balances levels, reduces crosstalk, and mixes them into a final file.
    This is ideal for interviews, panel discussions, or any recording with multiple microphones:
  &lt;/p&gt;

  &lt;div style="background: #111827; border-radius: 8px; overflow: hidden; margin: 1em auto; max-width: 680px;"&gt;
    &lt;div style="display: flex; align-items: center; gap: 6px; padding: 8px 16px; background: rgba(31, 41, 55, 0.6);"&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(239, 68, 68, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(234, 179, 8, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(34, 197, 94, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="margin-left: 8px; font-size: 12px; color: #6b7280; font-family: monospace;"&gt;Terminal&lt;/span&gt;
    &lt;/div&gt;
    &lt;div style="padding: 16px; font-family: monospace; font-size: 13px; line-height: 1.6;"&gt;
      &lt;p style="color: #6b7280; margin: 4px 0;"&gt;# Multitrack production with labeled tracks&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;span style="color: #4ade80;"&gt;$ &lt;/span&gt;&lt;span style="color: #f3f4f6;"&gt;auphonic process \&lt;/span&gt;&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;span style="color: #f3f4f6;"&gt;&amp;nbsp;&amp;nbsp;--track id=host,file=host.wav \&lt;/span&gt;&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;span style="color: #f3f4f6;"&gt;&amp;nbsp;&amp;nbsp;--track id=guest,file=guest.wav \&lt;/span&gt;&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;span style="color: #f3f4f6;"&gt;&amp;nbsp;&amp;nbsp;--track id=music,file=music.wav \&lt;/span&gt;&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;span style="color: #f3f4f6;"&gt;&amp;nbsp;&amp;nbsp;--wait --download&lt;/span&gt;&lt;/p&gt;
    &lt;/div&gt;
  &lt;/div&gt;

  &lt;p&gt;
    You can add &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html" target="_blank"&gt;Speech Recognition&lt;/a&gt;
    to generate transcripts alongside your processed audio,
    publish directly to an
    &lt;a href="https://auphonic.com/help/web/services.html" target="_blank"&gt;External Service&lt;/a&gt;
    like your podcast host or YouTube,
    or set default options so you don't have to repeat them every time:
  &lt;/p&gt;

  &lt;div style="background: #111827; border-radius: 8px; overflow: hidden; margin: 1em auto; max-width: 680px;"&gt;
    &lt;div style="display: flex; align-items: center; gap: 6px; padding: 8px 16px; background: rgba(31, 41, 55, 0.6);"&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(239, 68, 68, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(234, 179, 8, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(34, 197, 94, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="margin-left: 8px; font-size: 12px; color: #6b7280; font-family: monospace;"&gt;Terminal&lt;/span&gt;
    &lt;/div&gt;
    &lt;div style="padding: 16px; font-family: monospace; font-size: 13px; line-height: 1.6;"&gt;
      &lt;p style="color: #6b7280; margin: 4px 0;"&gt;# Add speech recognition&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;span style="color: #4ade80;"&gt;$ &lt;/span&gt;&lt;span style="color: #f3f4f6;"&gt;auphonic process episode.wav --speech-recognition --wait&lt;/span&gt;&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;/p&gt;
      &lt;p style="color: #6b7280; margin: 4px 0;"&gt;# Publish to your podcast host&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;span style="color: #4ade80;"&gt;$ &lt;/span&gt;&lt;span style="color: #f3f4f6;"&gt;auphonic process episode.wav --publish SERVICE_UUID --wait&lt;/span&gt;&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;/p&gt;
      &lt;p style="color: #6b7280; margin: 4px 0;"&gt;# Set a default preset for all future productions&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;span style="color: #4ade80;"&gt;$ &lt;/span&gt;&lt;span style="color: #f3f4f6;"&gt;auphonic config set default-preset "My Podcast"&lt;/span&gt;&lt;/p&gt;
    &lt;/div&gt;
  &lt;/div&gt;

  &lt;p&gt;
    See the &lt;a href="https://auphonic.com/help/resources/cli.html#commands" target="_blank"&gt;CLI reference documentation&lt;/a&gt;
    for the full command reference.
  &lt;/p&gt;


  &lt;h3&gt;Auphonic API Updates&lt;/h3&gt;

  &lt;p&gt;
    Alongside the CLI, we've made some improvements to the
    &lt;a href="https://auphonic.com/help/api/" target="_blank"&gt;Auphonic API&lt;/a&gt;
    that benefit both API users and CLI users alike.
  &lt;/p&gt;

  &lt;p&gt;
    &lt;strong&gt;OpenAPI specification:&lt;/strong&gt;
    The Auphonic API now has a full
    &lt;a href="https://auphonic.com/help/openapi.yaml" target="_blank"&gt;OpenAPI 3.0.3 specification&lt;/a&gt;
    covering all endpoints.
    In practice, this means you can auto-generate client libraries in your language of choice,
    get autocompletion in your IDE, and validate requests before sending them.
    You can also browse the full API interactively via
    &lt;a href="https://auphonic.com/help/redoc.html" target="_blank"&gt;ReDoc&lt;/a&gt;.
  &lt;/p&gt;

  &lt;p&gt;
    &lt;strong&gt;Preset names in API:&lt;/strong&gt;
    You can now reference presets by name instead of UUID in API calls &amp;mdash;
    no more copying opaque identifiers.
    The CLI uses this feature under the hood, which is why &lt;span style="font-family: monospace; background: #f3f4f6; padding: 1px 5px; border-radius: 3px; font-size: 13px;"&gt;--preset "My Podcast"&lt;/span&gt;
    works out of the box.
    If multiple presets share a name, priority goes to personal presets first, then shared, then default.
    See the
    &lt;a href="https://auphonic.com/help/api/simple_api.html#start-a-production-and-upload-a-file" target="_blank"&gt;API documentation&lt;/a&gt;
    for details.
  &lt;/p&gt;


  &lt;h3&gt;Get Started&lt;/h3&gt;

  &lt;p&gt;
    The CLI runs on macOS, Linux, and Windows.
    &lt;strong&gt;Install it with a single command:&lt;/strong&gt;
  &lt;/p&gt;

  &lt;div style="background: #111827; border-radius: 8px; overflow: hidden; margin: 1em auto; max-width: 680px;"&gt;
    &lt;div style="display: flex; align-items: center; gap: 6px; padding: 8px 16px; background: rgba(31, 41, 55, 0.6);"&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(239, 68, 68, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(234, 179, 8, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="width: 12px; height: 12px; border-radius: 50%; background: rgba(34, 197, 94, 0.8);"&gt;&lt;/span&gt;
      &lt;span style="margin-left: 8px; font-size: 12px; color: #6b7280; font-family: monospace;"&gt;Terminal&lt;/span&gt;
    &lt;/div&gt;
    &lt;div style="padding: 16px; font-family: monospace; font-size: 13px; line-height: 1.6;"&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;span style="color: #4ade80;"&gt;$ &lt;/span&gt;&lt;span style="color: #f3f4f6;"&gt;curl -sSL https://auphonic.com/cli/install.sh | sh&lt;/span&gt;&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;/p&gt;
      &lt;p style="color: #6b7280; margin: 4px 0;"&gt;# Authenticate with your Auphonic account&lt;/p&gt;
      &lt;p style="margin: 4px 0;"&gt;&lt;span style="color: #4ade80;"&gt;$ &lt;/span&gt;&lt;span style="color: #f3f4f6;"&gt;auphonic auth login&lt;/span&gt;&lt;/p&gt;
    &lt;/div&gt;
  &lt;/div&gt;

  &lt;p&gt;
    You can also download pre-built binaries directly from
    &lt;a href="https://auphonic.com/cli" target="_blank"&gt;auphonic.com/cli&lt;/a&gt;,
    where you'll find setup instructions for all platforms. &lt;br&gt;
    For the full command reference, see the
    &lt;a href="https://auphonic.com/help/resources/cli.html" target="_blank"&gt;CLI documentation&lt;/a&gt;.
  &lt;/p&gt;


  &lt;h3&gt;Feedback&lt;/h3&gt;

  &lt;p&gt;
    The CLI is a new addition to Auphonic, and we'd love to hear how it works for you.
    If you have feedback, run into issues, or have ideas for new features,
    please visit our
    &lt;a href="https://auphonic.com/contact" target="_blank"&gt;Contact Page&lt;/a&gt;
    or email us at
    &lt;a href="mailto:support@auphonic.com"&gt;support@auphonic.com&lt;/a&gt;.
  &lt;/p&gt;

  &lt;br&gt;
  &lt;br&gt;
  &lt;br&gt;
</description><author>grh@auphonic.com (Georg)</author><pubDate>Thu, 26 Mar 2026 06:55:42 +0000</pubDate><guid>https://auphonic.com/blog/2026/03/26/auphonic-cli/</guid><enclosure length="100000" type="image/png" url="http://auphonic.com/help/_images/cli-terminal.png"/><category>Development</category><category>News</category></item><item><title>New Auphonic Static and Music Denoiser
</title><link>https://auphonic.com/blog/2025/07/31/new-auphonic-static-and-music-denoiser/</link><description>
&lt;p&gt;
    &lt;strong&gt;Technology is evolving fast&lt;/strong&gt; and we’re the ones pushing it forward.&lt;p&gt;
      Our new &lt;strong&gt;Static Denoiser&lt;/strong&gt; removes steady background noise like 
      &lt;strong&gt;hiss, hum, or fan noise&lt;/strong&gt; while keeping &lt;strong&gt;music, ambience, and sound design&lt;/strong&gt;
      fully intact. 
      Perfect for &lt;strong&gt;audio dramas, videos, music, meditations, podcasts&lt;/strong&gt;, or anything where 
      &lt;strong&gt;clarity matters&lt;/strong&gt; &lt;em&gt;and&lt;/em&gt; the &lt;strong&gt;atmosphere does too&lt;/strong&gt;.
  &lt;/p&gt;
      

&lt;img src="https://auphonic.com/media/blog/StaticNoiseReduction.jpg" style="width: 700px;"&gt;

&lt;h3&gt;Auphonic’s Vision for Noise Reduction&lt;/h3&gt;
  
  &lt;h4&gt;We firmly believe that there is no “one button to fix it all.”&lt;br&gt;&lt;/h4&gt; 
  But how can we say that when it is our mission to build exactly that?

  &lt;p&gt;&lt;strong&gt;Our goal is precision:&lt;/strong&gt; Giving users full control over what stays and what goes.&lt;/p&gt;

  &lt;p&gt;That’s why we offer &lt;b&gt;different tools for different kinds of noise&lt;/b&gt;:&lt;/p&gt;

  &lt;ul&gt;
    &lt;li&gt;&lt;a href="https://auphonic.com/help/algorithms/singletrack.html#speech-isolation" target="_blank"&gt;
      &lt;strong&gt;Speech Isolation&lt;/strong&gt;&lt;/a&gt; removes everything but your voice. 
      It keeps only the speech you care about.
    &lt;/li&gt;
    &lt;li&gt;&lt;a href="https://auphonic.com/help/algorithms/singletrack.html#dynamic-denoiser" target="_blank"&gt;
      &lt;strong&gt;Dynamic Denoiser&lt;/strong&gt;&lt;/a&gt; adapts in real time to changing environments 
        - ideal for unpredictable noise patterns, while leaving music intact.
    &lt;/li&gt;
    &lt;li&gt;&lt;a href="https://auphonic.com/help/algorithms/singletrack.html#static-denoiser" target="_blank"&gt;
          &lt;strong&gt;Static Denoiser&lt;/strong&gt;&lt;/a&gt;
       now updated to precisely target stationary noise
        - like constant hiss or hum - while preserving music, ambient effects, and subtle details.
    &lt;/li&gt;
  &lt;/ul&gt;

  &lt;p&gt;Think of audio dramas where &lt;b&gt;sound effects&lt;/b&gt; should &lt;b&gt;stay untouched&lt;/b&gt;, 
    or meditation recordings with &lt;b&gt;soft tonal elements&lt;/b&gt; that must &lt;b&gt;remain intact&lt;/b&gt;. 
    Unlike non-stationary noises (e.g. coughs, chair squeaks or mouth clicks), stationary noise is 
    consistent over time - making it ideal for &lt;b&gt;static removal without harming your content&lt;/b&gt;.&lt;/p&gt;

  &lt;p&gt;While we believe in precise user control, there is a way to reduce your audio editing work 
    to almost one button: &lt;strong&gt;Saving your favorite settings as 
    &lt;a href="https://auphonic.com/engine/presets/ " target="_blank"&gt;Custom Presets&lt;/a&gt;&lt;/strong&gt; 
    and applying those to your productions. 
    You can even &lt;b&gt;&lt;a href="https://mastodon.social/@auphonic/114896774982464587" target="_blank"&gt;share your Presets&lt;/a&gt; with friends!&lt;/b&gt;&lt;br&gt; 
    But more on that later 😉&lt;/p&gt;

  &lt;h3&gt;How to Use the New Static Denoiser&lt;/h3&gt;
    &lt;p&gt;Cleaning up your audio is easier than ever:
      Just upload your file, choose the &lt;strong&gt;Static Denoiser&lt;/strong&gt; 
      in the &lt;a href="https://auphonic.com/engine/upload/ " target="_blank"&gt;&lt;strong&gt;Production Form&lt;/strong&gt;&lt;/a&gt;, 
      and let Auphonic do the rest. No deep tech knowledge required.&lt;/p&gt;

    &lt;p&gt;Fine-tune the sound by adjusting the &lt;strong&gt;Remove Noise&lt;/strong&gt; and &lt;strong&gt;Remove Reverb&lt;/strong&gt; sliders.
      Lower them slightly to retain some natural texture while still boosting clarity and speech intelligibility.&lt;/p&gt;

      &lt;p style="text-align: center;"&gt;
        &lt;img src="https://auphonic.com/media/blog/Screenshot_NewStaticDenoiser_dropdown.png" /&gt;
      &lt;/p&gt;


    &lt;p&gt;And for everyone still wanting to use the Original Static Denoiser:
      Don't worry, the &lt;strong&gt;Legacy Denoising Version is still available&lt;/strong&gt;. 
      We renamed it to &lt;b&gt;"Classic"&lt;/b&gt; and you can use it as normal. Just select it through the drop-down menu.&lt;/p&gt;



  &lt;h3&gt;Audio Examples&lt;/h3&gt;
    
    &lt;p&gt;It’s hard to &lt;em&gt;read&lt;/em&gt; audio, but easy to hear the difference.
      Here is a &lt;b&gt;sound comparison of our denoising models&lt;/b&gt; - please use headphones to hear all details!
    &lt;/p&gt;

&lt;h4&gt;New &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#static-denoiser" target="_blank"&gt;
  &lt;strong&gt;Static Denoiser&lt;/strong&gt;&lt;/a&gt; vs. 
  &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#speech-isolation" target="_blank"&gt;
  &lt;strong&gt;Speech Isolation&lt;/strong&gt;&lt;/a&gt;
&lt;/h4&gt;

&lt;p&gt;We processed a snippet from the 
  &lt;a href="https://www.listennotes.com/e/9642e85189e342fa90500d2b25a0d1de/" target="_blank"&gt;History of Jazz&lt;/a&gt; 
  podcast using both &lt;b&gt;Speech Isolation&lt;/b&gt; and the &lt;b&gt;new Static Denoiser&lt;/b&gt;. 
  Speech Isolation removes everything but the speech - including music, background vocals, and ambience - 
  resulting in a clean, voice-only track. In contrast, the Static Denoiser keeps the musical texture 
  intact while removing just the steady background noise. Hear the difference for yourself!&lt;/p&gt;

&lt;table style="text-align: center; width: 100%; color: gray; font-size: small;"&gt;
  &lt;tr&gt;
    &lt;td style="padding: 0; vertical-align: middle;"&gt;Original:&lt;/td&gt;
    &lt;td style="width: 90%; padding: 0; vertical-align: middle;"&gt;
      &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/HistoryOfJazzPodcast20LUFS_153drum13s.flac" type="audio/flac"&gt;&lt;/audio&gt;
    &lt;/td&gt;
  &lt;/tr&gt;  
  &lt;tr&gt;
    &lt;td style="padding: 0; vertical-align: middle;"&gt;Static Denoised:&lt;/td&gt;
    &lt;td style="width: 90%; padding: 0; vertical-align: middle;"&gt;
      &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/HistoryOfJazzPodcast20LUFS_153drum13s_ST100NR_BWE20LUFS.flac" type="audio/flac"&gt;&lt;/audio&gt;
    &lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td style="padding: 0; vertical-align: middle;"&gt;Speech Isolated:&lt;/td&gt;
    &lt;td style="width: 90%; padding: 0; vertical-align: middle;"&gt;
      &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/HistoryOfJazzPodcast20LUFS_153drum13s_SI100NR_BWE20LUFS.flac" type="audio/flac"&gt;&lt;/audio&gt;
    &lt;/td&gt;
  &lt;/tr&gt;
&lt;/table&gt;

&lt;h4&gt;New &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#static-denoiser" target="_blank"&gt;
  &lt;strong&gt;Static Denoiser&lt;/strong&gt;&lt;/a&gt; vs.
  &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#dynamic-denoiser" target="_blank"&gt;
  &lt;strong&gt;Dynamic Denoiser&lt;/strong&gt;&lt;/a&gt;
&lt;/h4&gt;

&lt;p&gt;
  In this excerpt from the German audio drama &lt;a href="https://youtu.be/TqfYU8uiwys?si=2rvsunkvSum2uLtA&amp;t=725" target="_blank"&gt;Der Graue&lt;/a&gt;,
  the &lt;b&gt;&lt;em&gt;&amp;#8220Static Denoiser&amp;#8221&lt;/em&gt; perfectly preserves all the sound effects&lt;/b&gt; while removing the reverb and static noise,
  whereas the &lt;em&gt;&amp;#8220Dynamic Denoiser&amp;#8221&lt;/em&gt; (or &lt;em&gt;&amp;#8220Speech Isolation&amp;#8221&lt;/em&gt;) removes everything from the audio that is not speech:
&lt;/p&gt;
&lt;table style="text-align: center; width: 100%; color: gray; font-size: small;"&gt;
  &lt;tr&gt;
    &lt;td style="padding: 0; vertical-align: middle;"&gt;Original:&lt;/td&gt;
    &lt;td style="width: 90%; padding: 0; vertical-align: middle;"&gt;
      &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/DerGraue1l79-130_20LUFS_cut.flac" type="audio/flac"&gt;&lt;/audio&gt;
    &lt;/td&gt;
  &lt;/tr&gt;  
  &lt;tr&gt;
    &lt;td style="padding: 0; vertical-align: middle;"&gt;Static Denoised:&lt;/td&gt;
    &lt;td style="width: 90%; padding: 0; vertical-align: middle;"&gt;
      &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/DerGraue1l79-130_20LUFS_cut_ST100EQ20LUFS.flac" type="audio/flac"&gt;&lt;/audio&gt;
    &lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td style="padding: 0; vertical-align: middle;"&gt;Dynamic Denoised:&lt;/td&gt;
    &lt;td style="width: 90%; padding: 0; vertical-align: middle;"&gt;
      &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/DerGraue1l79-130_20LUFS_cut_DY100EQ20LUFS.flac" type="audio/flac"&gt;&lt;/audio&gt;
    &lt;/td&gt;
  &lt;/tr&gt;
&lt;/table&gt;

&lt;h4&gt;New &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#static-denoiser" target="_blank"&gt;
  &lt;strong&gt;Static Denoiser&lt;/strong&gt;&lt;/a&gt; vs. Legacy Classic Denoiser&lt;/h4&gt;

  &lt;p&gt;To show how far we’ve come, we processed a short segment from the radio play 
    &lt;a href="https://youtu.be/2x9IezS3-Ac?si=Yh66Lw4aR2Rxuaot&amp;t=1304" target="_blank"&gt;Around the World in 80 Days&lt;/a&gt; 
    using both the &lt;b&gt;new Static Denoiser&lt;/b&gt; and the &lt;b&gt;legacy Classic Denoiser&lt;/b&gt;. &lt;br&gt;
    The Classic Denoiser extracts noise prints in speech pauses, which is not possible in this example because of the background music, whereas the new model removes noise cleanly.
  &lt;/p&gt;

&lt;table style="text-align: center; width: 100%; color: gray; font-size: small;"&gt;
  &lt;tr&gt;
    &lt;td style="padding: 0; vertical-align: middle;"&gt;Original:&lt;/td&gt;
    &lt;td style="width: 90%; padding: 0; vertical-align: middle;"&gt;
      &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/JulesVerne-AroundTheWorldIn80daysOrig278L10s_20LUFS.flac" type="audio/flac"&gt;&lt;/audio&gt;
    &lt;/td&gt;
  &lt;/tr&gt;  
  &lt;tr&gt;
    &lt;td style="padding: 0; vertical-align: middle;"&gt;Old Classic Denoised:&lt;/td&gt;
    &lt;td style="width: 90%; padding: 0; vertical-align: middle;"&gt;
      &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/JulesVerne-AroundTheWorldIn80daysOrig278L10s_20LUFS_CL100EQ20LUFS.flac" type="audio/flac"&gt;&lt;/audio&gt;
    &lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td style="padding: 0; vertical-align: middle;"&gt;New Static Denoised:&lt;/td&gt;
    &lt;td style="width: 90%; padding: 0; vertical-align: middle;"&gt;
      &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/JulesVerne-AroundTheWorldIn80daysOrig278L10s_20LUFS_ST100EQ20LUFS.flac" type="audio/flac"&gt;&lt;/audio&gt;
    &lt;/td&gt;
  &lt;/tr&gt;
&lt;/table&gt;




&lt;h3&gt;Try It Out&lt;/h3&gt;
  &lt;p&gt;&lt;b&gt;The new  Static Denoiser is live!&lt;/b&gt; Perfect for cleaning up hiss and hum while 
    keeping music and sound effects intact.&lt;/p&gt;

  &lt;p&gt;If you're used to the legacy model and want to keep using it: No worries, 
    the &lt;b&gt;Classic Denoiser is still available&lt;/b&gt;. Want more aggressive cleanup or even full music removal? 
    &lt;a href="https://auphonic.com/" target="_blank"&gt;
      &lt;strong&gt;Try our Dynamic Denoiser or Speech Isolation models&lt;/strong&gt;
    &lt;/a&gt;.
  &lt;/p&gt;

  &lt;p&gt;&lt;strong&gt;Re-processing is free&lt;/strong&gt; - so go wild:&lt;br&gt;
  As long as you don’t change your input file, you can tweak your production and &lt;b&gt;test different 
    settings without using extra credits&lt;/b&gt;. Try it. Break it. Compare models. Save your 
    findings as your favorite preset. And please, tell us what you think!&lt;/p&gt;

&lt;h3&gt;Feedback&lt;/h3&gt;
  &lt;p&gt;We drive innovation through your feedback and would love to hear how the new Static Denoiser works for you.
    &lt;b&gt;Send us your thoughts&lt;/b&gt; through the production feedback form or reach out through our 
    &lt;a href="https://auphonic.com/contact" target="_blank"&gt;&lt;strong&gt;Contact Page&lt;/strong&gt;&lt;/a&gt;.
  &lt;/p&gt; 
  
  &lt;p&gt;Every bit of input helps us fine-tune things further.&lt;/p&gt;

&lt;br&gt;
&lt;br&gt;
&lt;br&gt;
</description><author>lukasm@auphonic.com (Lukas)</author><pubDate>Thu, 31 Jul 2025 09:43:32 +0000</pubDate><guid>https://auphonic.com/blog/2025/07/31/new-auphonic-static-and-music-denoiser/</guid><enclosure length="100000" type="image/jpeg" url="http://auphonic.com/media/blog/StaticNoiseReduction.jpg"/><category>Audio</category><category>Development</category><category>News</category></item><item><title>Independently control Noise, Reverb and Breath Reduction Amounts
</title><link>https://auphonic.com/blog/2024/05/16/independently-control-noise-reverb-and-breath-reduction-amounts/</link><description>
&lt;p&gt;
    Responding to your feedback, we are now proud to present new &lt;b&gt;separate parameters for noise, reverb, and breath reduction&lt;/b&gt; to give you &lt;b&gt;more flexible control&lt;/b&gt; for your individual, best output results. &lt;br&gt;
    Find all the &lt;a href="#parameters"&gt;new parameters&lt;/a&gt; below and listen to the &lt;a href="#examples"&gt;Audio Examples&lt;/a&gt; to get a closer impression of the upgrade.
  &lt;/p&gt;

  &lt;img src="https://auphonic.com/media/blog/multiple-reduction-amounts.jpg" width="700"/&gt;
  &lt;br&gt;


  &lt;h3&gt;What's the update about?&lt;/h3&gt;

  &lt;h4&gt;Before&lt;/h4&gt;
  &lt;p&gt;
    Previously, you could only set the &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#noise-and-reverb-reduction"&gt;Denoising Method&lt;/a&gt; and &lt;b&gt;one reduction amount, that was used for all elements&lt;/b&gt;. &lt;br&gt;
    Depending on the selected method, you were already able to decide whether music, static, or changing noises should be removed, but &lt;b&gt;there was no setting to keep the typewriter sound effects while removing the reverb&lt;/b&gt;, for example. &lt;br&gt;
    &lt;!-- Our frequent users already know the concept of our three &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#noise-and-reverb-reduction"&gt;Denoising Methods&lt;/a&gt;: The &lt;b&gt;Static Denoiser&lt;/b&gt; only removes stationary noise, the &lt;b&gt;Dynamic Denoiser&lt;/b&gt; keeps speech and music while removing noise and reverb (plus 'Remove Breathings' option) and the &lt;b&gt;Speech Isolation&lt;/b&gt; method keeps only speech while removing noise, reverb and music (plus 'Remove Breathings' option). &lt;br&gt; --&gt;
  &lt;/p&gt;

  &lt;h4&gt;Now&lt;/h4&gt;
  &lt;p&gt;
    With our latest upgrade, you can &lt;b&gt;now set the reduction amounts separately for noise, reverb, and breathing sounds&lt;/b&gt;. &lt;br&gt;
    &lt;!-- For example, you could &lt;b&gt;keep sound effects&lt;/b&gt; like steps and singing birds in an audio play, but &lt;b&gt;slightly reduce&lt;/b&gt; the level of all &lt;b&gt;breathings&lt;/b&gt; and &lt;b&gt;completely remove the reverb&lt;/b&gt;. --&gt;
    For example, you could completely remove the background noise while &lt;b&gt;reducing the reverb just a little&lt;/b&gt; to enhance speech 
    intelligibility but keep the atmosphere. Like we did in &lt;a href="#ex1"&gt;Audio Example 1&lt;/a&gt;. &lt;br&gt;
    Many of you have also asked about the possibility of &lt;b&gt;slightly reducing breath sounds&lt;/b&gt; rather than eliminating them completely. In &lt;a href="#ex2"&gt;Audio Example 2&lt;/a&gt; we demonstrate how you can &lt;b&gt;prevent your audio from sounding strange and unnatural&lt;/b&gt; by reducing instead of eliminating all breathing sounds.
  &lt;/p&gt;
  &lt;p&gt;
    To all of you who are happy with the results and don't want anything to change, &lt;b&gt;relax&lt;/b&gt;: &lt;br&gt;
    If you don't change the &lt;b&gt;default settings&lt;/b&gt;, the noise reduction algorithms work &lt;b&gt;exactly the same as before&lt;/b&gt;.
  &lt;/p&gt;
  &lt;p&gt;
    &lt;b&gt;Note:&lt;/b&gt; As the 'Static Denoiser' removes only stationary noise, there are no 'Remove Reverb' and 'Remove Breathings' parameters available for this denoising method.  
  &lt;/p&gt;

  &lt;h3 id="parameters"&gt;New Parameters&lt;/h3&gt;
  &lt;p style="text-align: center; font-size: small; font-style: italic"&gt;
    &lt;img src="https://auphonic.com/media/blog/screenshot_denoise_deverb_amount.png" /&gt;
    Screenshot of the new &lt;a href="#parameters"&gt;Noise Reduction Parameters&lt;/a&gt; in the production form.
  &lt;/p&gt;
  &lt;p&gt;
    In order to use the new noise reduction features, you may &lt;b&gt;separately set the following parameters&lt;/b&gt;:
  &lt;/p&gt;
  &lt;p&gt;
    &lt;ul&gt;
      &lt;li&gt;&lt;b&gt;Denoising Method:&lt;/b&gt; (unchanged) Select what kind of noise you want to remove.&lt;br&gt;
        &lt;em&gt;[&lt;a href="https://auphonic.com/help/algorithms/singletrack.html#dynamic-denoiser"&gt;Dynamic Denoiser&lt;/a&gt; (default), 
        &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#speech-isolation"&gt;Speech Isolation&lt;/a&gt;, 
        &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#static-denoiser"&gt;Static Denoiser&lt;/a&gt;]&lt;/em&gt; &lt;br&gt;
        &lt;em&gt;Note that the parameters 'Remove Reverb' and 'Remove Breathings' are NOT available for Static Denoiser!&lt;/em&gt;&lt;/li&gt;
      &lt;li&gt;&lt;b&gt;Remove Noise:&lt;/b&gt; Select the amount of &lt;b&gt;noise&lt;/b&gt; you want to remove. &lt;br&gt;
        &lt;em&gt;[100 dB (default), Disable Denoise, 3 dB, 6 dB, ..., 100 dB (full)]&lt;/em&gt;&lt;/li&gt;
      &lt;li&gt;&lt;b&gt;Remove Reverb:&lt;/b&gt; Select the amount of &lt;b&gt;reverb&lt;/b&gt; you want to remove. &lt;br&gt;
        &lt;em&gt;[100 dB (default), Disable Deverb, 3 dB, 6 dB, ..., 100 dB (full)]&lt;/em&gt;&lt;/li&gt;
      &lt;li&gt;&lt;b&gt;Remove Breathings:&lt;/b&gt; Select the amount of breathings you want to remove. &lt;br&gt;
        &lt;em&gt;[Off (default), 3 dB, 6 dB, ..., 100 dB (full)]&lt;/em&gt;&lt;/li&gt;
    &lt;/ul&gt;
  &lt;/p&gt;
  &lt;p&gt;
    &lt;b&gt;Feel free to experiment with all the options to find your preferred parameter settings!&lt;/b&gt; Editing and reprocessing existing productions does not cost any additional credits as long as you don't change the input file.
  &lt;/p&gt;


  &lt;h2 id="examples"&gt;Listen to the results:&lt;/h2&gt;

  &lt;h4 id="ex1"&gt;1. Reverb reduction with full noise elimination&lt;/h4&gt;
    &lt;p&gt;
      For the first audio example by &lt;a href="https://www.conduitministries.com/" target="_blank"&gt;conduitministries.com&lt;/a&gt; we set the 'Remove Noise' amount to &lt;em&gt;100 dB (full)&lt;/em&gt; and varied the 'Remove Reverb' amount starting from &lt;em&gt;0 dB (Off)&lt;/em&gt; to &lt;em&gt;12 dB (medium)&lt;/em&gt; and to &lt;em&gt;100 dB (full)&lt;/em&gt;.
      Listen to how first the noise is gone and then step by step the reverb is lower:
    &lt;/p&gt;
    &lt;table style="text-align: right; width: 100%; color: gray; font-size: small;"&gt;
      &lt;tr&gt;
          &lt;td style="padding: 0; vertical-align: middle;"&gt;Original &lt;/td&gt;
          &lt;td style="width: 88%; padding: 5; vertical-align: middle;"&gt;
            &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/ds585_reverb-18LUFS_cut.flac" type="audio/flac"&gt;&lt;/audio&gt;
          &lt;/td&gt;
      &lt;/tr&gt;
      &lt;tr&gt;
          &lt;td style="padding: 0; vertical-align: left;"&gt;-100dB Denoise &lt;br&gt;-0dB Deverb&amp;nbsp;&amp;nbsp; &lt;/td&gt;
          &lt;td style="width: 88%; padding: 5; vertical-align: middle;"&gt;
            &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/ds585_reverb-18LUFS_cut_NB100.mp3" type="audio/flac"&gt;&lt;/audio&gt;
          &lt;/td&gt;
      &lt;/tr&gt;
      &lt;tr&gt;
          &lt;td style="padding: 0; vertical-align: left;"&gt;-100dB Denoise &lt;br&gt;-12dB Deverb&amp;nbsp;&amp;nbsp; &lt;/td&gt;
          &lt;td style="width: 88%; padding: 5; vertical-align: middle;"&gt;
            &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/ds585_reverb-18LUFS_cut_NB100R012.mp3" type="audio/flac"&gt;&lt;/audio&gt;
          &lt;/td&gt;
      &lt;/tr&gt;
      &lt;tr&gt;
          &lt;td style="padding: 0; vertical-align: left;"&gt;-100dB Denoise &lt;br&gt;-100dB Deverb&amp;nbsp;&amp;nbsp; &lt;/td&gt;
          &lt;td style="width: 88%; padding: 5; vertical-align: middle;"&gt;
            &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/ds585_reverb-18LUFS_cut_NBR100.mp3" type="audio/flac"&gt;&lt;/audio&gt;
          &lt;/td&gt;
      &lt;/tr&gt;
    &lt;/table&gt;
 
  &lt;h4 id="ex2"&gt;2. Breathing sound reduction&lt;/h4&gt;
    &lt;p&gt;
      In the breathing reduction audio example by &lt;a href="https://www.archive.org/download/cheesecurd_1209.poem_librivox/cheesecurd_mcintyre_kpm_64kb.mp3" target="_blank"&gt;LibriVox.org&lt;/a&gt; we used the 'Remove Breathing' amounts increasing from the original audio with &lt;em&gt;0 dB (Off)&lt;/em&gt; to &lt;em&gt;12 dB (medium)&lt;/em&gt; and to &lt;em&gt;100 dB (full)&lt;/em&gt; reduction. &lt;br&gt;
      In the result files you can hear, that the &lt;em&gt;100 dB (full)&lt;/em&gt; elimination leads to weird, unnatural-sounding pauses, that can be prevented by just reducing the breathing sounds:
    &lt;/p&gt;
    &lt;table style="text-align: right; width: 100%; color: gray; font-size: small;"&gt;
      &lt;tr&gt;
          &lt;td style="padding: 0; vertical-align: middle;"&gt;Original &lt;/td&gt;
          &lt;td style="width: 88%; padding: 5; vertical-align: middle;"&gt;
            &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/cheesecurd-20LUFS_15s.flac" type="audio/flac"&gt;&lt;/audio&gt;
          &lt;/td&gt;
      &lt;/tr&gt;
      &lt;tr&gt;
          &lt;td style="padding: 0; vertical-align: left;"&gt;-12dB Debreath &lt;/td&gt;
          &lt;td style="width: 88%; padding: 5; vertical-align: middle;"&gt;
            &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/cheesecurd_NR100B12only_15s.flac" type="audio/flac"&gt;&lt;/audio&gt;
          &lt;/td&gt;
      &lt;/tr&gt;
      &lt;tr&gt;
          &lt;td style="padding: 0; vertical-align: left;"&gt;-100dB Debreath &lt;/td&gt;
          &lt;td style="width: 88%; padding: 5; vertical-align: middle;"&gt;
            &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/cheesecurd_NRB100only_15s.flac" type="audio/flac"&gt;&lt;/audio&gt;
          &lt;/td&gt;
      &lt;/tr&gt;
    &lt;/table&gt;


  &lt;p&gt;
    Try it now on
    &lt;a href="https://auphonic.com/" target="blank"&gt;auphonic.com&lt;/a&gt;!
  &lt;/p&gt;

  &lt;h2&gt;Feedback&lt;/h2&gt;
  &lt;p&gt;
    We hope you like our upgraded version of the &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#noise-and-reverb-reduction"&gt;Noise Reduction Algorithms&lt;/a&gt; 
    with new parameters for more control. &lt;br&gt;
    If you have more feature requests or feedback for us, please
    &lt;a href="https://auphonic.com/contact" target="blank"&gt;let us know&lt;/a&gt;! You
    can also leave a comment in the feedback section on the status page of your
    specific production. We're looking forward to hearing from you!
  &lt;/p&gt;

  &lt;br /&gt;
  &lt;br /&gt;
  &lt;br /&gt;
</description><author>mpagavino@auphonic.com (Manuel)</author><pubDate>Thu, 16 May 2024 13:20:55 +0000</pubDate><guid>https://auphonic.com/blog/2024/05/16/independently-control-noise-reverb-and-breath-reduction-amounts/</guid><enclosure length="100000" type="image/jpeg" url="http://auphonic.com/media/blog/multiple-reduction-amounts.jpg"/><category>Audio</category><category>Development</category><category>News</category></item><item><title>New Auphonic Transcript Editor
</title><link>https://auphonic.com/blog/2024/03/21/new-auphonic-transcript-editor/</link><description>
&lt;p&gt;
    We're excited to roll out an
    &lt;b&gt;upgraded version of our Transcript Editor&lt;/b&gt;, focusing on enhancing your
    transcription workflow and making it more intuitive, especially for mobile
    users. This overhaul introduces several key improvements and features
    designed to streamline the transcription process.
  &lt;/p&gt;

  &lt;p style="text-align: center; font-weight: bold; padding-top: 1em;"&gt;
    &lt;a
    href="https://auphonic.com/media/audio-examples/speechrec/TVEyeOnMarvelS1E1_whisper.html"
    target="_blank"&gt;Click here for a Live Demo&lt;/a&gt;
  &lt;/p&gt;

  &lt;img src="https://auphonic.com/media/blog/transcript_editor_title_image.jpg" width="700" height="393.22" /&gt;

  &lt;h2&gt;What's new?&lt;/h2&gt;

  &lt;h3&gt;Line by Line Editing&lt;/h3&gt;
  &lt;p&gt;
    Your transcript is being &lt;b&gt;rendered line by line&lt;/b&gt;. This allows for
    precise editing of every single timestamp. Depending on the
    speech recognition engine editing can be done on word or phrase level.
    &lt;br /&gt;
    For optimal results, we suggest utilizing our
    &lt;a
      href="https://auphonic.com/help/algorithms/speech_recognition.html#auphonic-whisper-asr"
      target="_blank"
      &gt;Auphonic Whisper ASR&lt;/a
    &gt; engine.
  &lt;/p&gt;

  &lt;p style="text-align: center; font-size: small; font-style: italic"&gt;
    &lt;img src="https://auphonic.com/media/blog/transcript_editor_paragraph_with_9_lines.png" width="875" /&gt;
    A paragraph with 9 lines, every line represents a "subtitle line" (.vtt,
    .srt).
  &lt;/p&gt;

  &lt;p&gt;
    You can split or combine paragraphs and lines using the &lt;b&gt;Enter&lt;/b&gt; and
    &lt;b&gt;Backspace&lt;/b&gt; keys. Our new &lt;b&gt;Playback Slider&lt;/b&gt; enables seamless
    scrolling through the text, while we highlight the currently selected word
    as you go. With the switchable &lt;b&gt;Play on Click&lt;/b&gt; function you can start
    your playback from anywhere in the transcript.
  &lt;/p&gt;

  &lt;h3&gt;Automatic Shownotes and Chapters&lt;/h3&gt;
  &lt;p&gt;
    If you enable
    &lt;a
      href="https://auphonic.com/help/algorithms/speech_recognition.html#automatic-shownotes-and-chapters"
      target="_blank"
      &gt;Automatic Shownotes and Chapters&lt;/a
    &gt;
    in the Production form, we include AI generated shownotes and chapters
    directly into the Transcript Editor. You can
    &lt;b&gt;edit Chapter Times and Text&lt;/b&gt; directly within the Transcript Editor.
    Once you click &lt;b&gt;Save&lt;/b&gt; (top right), any modifications made within the
    shownotes and chapters will also be saved back to the production.
  &lt;/p&gt;

  &lt;p style="text-align: center; font-size: small; font-style: italic"&gt;
    &lt;img src="https://auphonic.com/media/blog/transcript_editor_automatic_shownotes.png" width="875" /&gt;
    Screenshot of Automatic Shownotes and Chapters withing the Transcript
    Editor.
  &lt;/p&gt;

  &lt;p&gt;
    You are also able to edit chapter times directly within the transcript
    editor. Please note that this only works within the
    &lt;b&gt;Transcript&lt;/b&gt; section of the editor to ensure precise placement of
    chapters.
  &lt;/p&gt;
  &lt;p style="text-align: center; font-size: small; font-style: italic"&gt;
    &lt;img src="https://auphonic.com/media/blog/transcript_editor_edit_chapter_time.png" width="500" /&gt;
    Screenshot of Edit Chapter Time.
  &lt;/p&gt;

  &lt;h3&gt;Local History: Undo and Redo&lt;/h3&gt;
  &lt;p&gt;
    Our Local History feature offers convenient
    &lt;b&gt;undo&lt;/b&gt; and &lt;b&gt;redo&lt;/b&gt; functionality. This means you can effortlessly
    revert changes or redo them as needed, providing you with greater control
    and flexibility during the editing process.
  &lt;/p&gt;

  &lt;h3&gt;Edit Speakers&lt;/h3&gt;
  &lt;p&gt;
    Our revamped Transcript Editor
    &lt;b&gt;automatically assigns speakers&lt;/b&gt; in
    &lt;a href="https://auphonic.com/help/web/multitrack.html"
      &gt;Multitrack Productions&lt;/a
    &gt;. You can use the &lt;b&gt;Track Identifier&lt;/b&gt; in our production form to assign
    speakers and easily edit, remove, or add new ones within the Transcript
    Editor. So it's clear who says what at any time!
  &lt;/p&gt;

  &lt;p style="text-align: center; font-size: small; font-style: italic"&gt;
    &lt;img src="https://auphonic.com/media/blog/transcript_editor_edit_speaker_name.png" width="875" /&gt;
    Screenshot of Edit Speaker.
  &lt;/p&gt;

  &lt;h3&gt;Confidence Highlighting&lt;/h3&gt;
  &lt;p&gt;
    Within the transcription areas of low confidence are highlighted with
    our &lt;b&gt;Highlight Low Confidence&lt;/b&gt; feature. So you can check up on the AI and
    edit words it is not entirely confident about! It also shows you how
    confident the AI is about the specific area if you hover over it.
  &lt;/p&gt;

  &lt;h3&gt;Offline Mode&lt;/h3&gt;
  &lt;p&gt;
    This feature enables you to
    &lt;b&gt;download and share the Transcript Editor&lt;/b&gt; for offline editing. This
    means you can share the *.html file with someone else for editing purposes.
    Simply use the &lt;b&gt;Download Editor&lt;/b&gt; action to obtain an offline version of
    the Transcript Editor.
  &lt;/p&gt;

  &lt;h3&gt;Export Transcript&lt;/h3&gt;
  &lt;p&gt;
    You can
    &lt;b&gt;export your transcript in all currently relevant formats&lt;/b&gt; (.srt, .vtt,
    .txt, .html, .pdf) and include them in your publishing process or video
    editing software.
  &lt;/p&gt;

  &lt;p style="text-align: center; font-size: small; font-style: italic"&gt;
    &lt;img src="https://auphonic.com/media/blog/transcript_editor_export.png" width="875" /&gt;
    Screenshot of Export Transcript.
  &lt;/p&gt;

  &lt;h3&gt;Responsive Design&lt;/h3&gt;
  &lt;p&gt;
    We want to make podcasting as easy as possible. The responsive design of our
    Transcript Editor allows for &lt;b&gt;fast editing on mobile devices&lt;/b&gt; - so you
    can edit what you're saying anywhere, anytime!
  &lt;/p&gt;

  &lt;p style="text-align: center; font-size: small; font-style: italic"&gt;
    &lt;img src="https://auphonic.com/media/blog/transcript_editor_responsive_ui.png" width="200" /&gt;
    Screenshot of Transcript Editor on a mobile device.
  &lt;/p&gt;

  &lt;h2&gt;Why use it?&lt;/h2&gt;
  &lt;p&gt;
    &lt;a
      href="https://podcasters.apple.com/support/5316-transcripts-on-apple-podcasts"
      target="_blank"
      &gt;Apple Podcasts&lt;/a
    &gt;
    paved the way for all podcasts to feature transcripts, so users can read
    through what you are saying and look for specific parts of your talk they're
    interested in revisiting. That could be book recommendations, advertisements
    or one of your ideas you want to share with the world.
  &lt;/p&gt;

  &lt;p&gt;
    We're trying to make the transcription process as automated and easy for you
    as possible. Especially when using
    &lt;a href="https://auphonic.com/help/web/multitrack.html"
      &gt;Multitrack Productions&lt;/a
    &gt;
    we strive to take all the work off of you and let the AI do its magic.
  &lt;/p&gt;

  &lt;p&gt;
    Additionally, we aim to simplify manual corrections wherever automated
    processes encounter challenges, ensuring that even these adjustments are as
    effortless as possible.
  &lt;/p&gt;

  &lt;p&gt;
    Try it now on
    &lt;a href="https://auphonic.com/" target="blank"&gt;auphonic.com&lt;/a&gt;!
  &lt;/p&gt;

  &lt;h2&gt;Feedback&lt;/h2&gt;
  &lt;p&gt;
    If you have feature requests or feedback for our new Transcript Editor, please
    &lt;a href="https://auphonic.com/contact" target="blank"&gt;let us know&lt;/a&gt;! You
    can also leave a comment in the feedback section on the status page of your
    specific production. &lt;br&gt;
    We're looking forward to hearing from you!
  &lt;/p&gt;

  &lt;br /&gt;
  &lt;br /&gt;
  &lt;br /&gt;
</description><author>manuelw@auphonic.com (Manuel)</author><pubDate>Thu, 21 Mar 2024 12:31:22 +0000</pubDate><guid>https://auphonic.com/blog/2024/03/21/new-auphonic-transcript-editor/</guid><enclosure length="100000" type="image/jpeg" url="http://auphonic.com/media/blog/transcript_editor_title_image.jpg"/><category>Development</category><category>News</category></item><item><title>Improve your Audio with our new Automatic Filler Word Cutter
</title><link>https://auphonic.com/blog/2023/10/04/new-automatic-filler-word-cutter/</link><description>
&lt;p&gt;
  We all know the problem: the content is perfectly prepared, and everything is in place, but the moment 
  you hit the record button, your brain freezes, and what pops out of your mouth is a &lt;b&gt;rain of 
  &lt;em&gt;&amp;#8220ums&amp;#8221&lt;/em&gt;, &lt;em&gt;&amp;#8220uhs&amp;#8221&lt;/em&gt;, and &lt;em&gt;&amp;#8220mhs&amp;#8221&lt;/em&gt; that no listener would enjoy&lt;/b&gt;. &lt;br&gt;
  Cleaning up a record like that by &lt;b&gt;manually cutting out every single filler word is a painstaking task&lt;/b&gt;.
&lt;/p&gt;
&lt;p&gt;
  So we heard your requests to &lt;b&gt;automate the filler word removing task&lt;/b&gt;, started implementing it, and are now very happy 
  to release our new &lt;b&gt;Automatic Filler Cutter&lt;/b&gt; feature.
  See our &lt;a href="#examples"&gt;Audio Examples&lt;/a&gt; and &lt;a href="#instructions"&gt;Usage Instructions&lt;/a&gt; below.
&lt;/p&gt;

&lt;img src="https://auphonic.com/media/blog/fotor-ai-20231002142827_undoing_mistakes.jpg" width="768" height="432" /&gt; 


&lt;h3&gt;What is removed?&lt;/h3&gt;
&lt;p&gt;
  While the definition of filler words is not the same, depending on who you ask, 
  some words can be used as filler as well as content. For example, 
  &lt;em&gt;&amp;#8220like&amp;#8221&lt;/em&gt;, &lt;em&gt;&amp;#8220well&amp;#8221&lt;/em&gt;, &lt;em&gt;&amp;#8220you know&amp;#8221&lt;/em&gt;, etc. cannot be removed without 
  the risk of removing also content and destroying sentences, even if those words are 
  used as filler words in some cases.
&lt;/p&gt;
&lt;p&gt;
  Therefore, we decided to focus on the removal of the obvious fillers, namely &lt;b&gt;any kind 
  of &lt;em&gt;&amp;#8220ums&amp;#8221&lt;/em&gt;, &lt;em&gt;&amp;#8220uhs&amp;#8221&lt;/em&gt;, &lt;em&gt;&amp;#8220mhs&amp;#8221&lt;/em&gt;, German &lt;em&gt;&amp;#8220ähm&amp;#8221&lt;/em&gt;, &lt;em&gt;&amp;#8220äh&amp;#8221&lt;/em&gt;, 
  &lt;em&gt;&amp;#8220öh&amp;#8221&lt;/em&gt;, French &lt;em&gt;&amp;#8220euh&amp;#8221&lt;/em&gt;, &lt;em&gt;&amp;#8220euhm&amp;#8221&lt;/em&gt; and similar&lt;/b&gt;.
&lt;/p&gt;



&lt;h3 id="examples"&gt;Audio Examples&lt;/h3&gt;

  &lt;h4&gt;1. English Male Speaker&lt;/h4&gt;
    &lt;p&gt;
      The first audio example is an excerpt from the interview
      &lt;a href="https://youtu.be/p-tfbDQxXhI" target="_blank"&gt;&amp;#8220From Racing Failure to Red Bull Champion: The Untold Christian Horner Story&amp;#8221&lt;/a&gt;.
      Our algorithm found and removed a remarkable &lt;b&gt;ten filler words&lt;/b&gt; in this 45-second snippet:
    &lt;/p&gt;
    &lt;p style="text-align: center; font-size: small; font-style: italic;"&gt; 
      &lt;!-- &lt;img src="Screenshot_ChristianHorner_45s_NOtimeline_NOcursor.png"/&gt; --&gt;
      &lt;img src="https://auphonic.com/media/blog/Screenshot_ChristianHorner_45s_NOtimeline.png" width="875"/&gt;
      &lt;!-- &lt;img src="Screenshot_ChristianHorner_45s_timeline.png"/&gt; --&gt;
      Screenshot of the &lt;a href="https://auphonic.com/help/web/audioinspector.html#web-audioinspector" target="_blank"&gt;
      Auphonic Audio Inspector&lt;/a&gt;: each pale red shaded area corresponds to a cut-out filler word.
    &lt;/p&gt;
    &lt;table style="text-align: center; width: 100%; color: gray; font-size: small;"&gt;
      &lt;tr&gt;
          &lt;td style="padding: 0; vertical-align: middle;"&gt;Original:&lt;/td&gt;
          &lt;td style="width: 90%; padding: 0; vertical-align: middle;"&gt;
            &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/snippet1_REF_ChristianHornerStory_fillerNOcut_AllAlg.flac" type="audio/flac"&gt;&lt;/audio&gt;
          &lt;/td&gt;
      &lt;/tr&gt;
      &lt;tr&gt;
          &lt;td style="padding: 0; vertical-align: middle;"&gt;Cut:&lt;/td&gt;
          &lt;td style="width: 90%; padding: 0; vertical-align: middle;"&gt;
            &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/snippet1cut_CUT_ChristianHornerStory_fillercut_AllAlg.flac" type="audio/flac"&gt;&lt;/audio&gt;
          &lt;/td&gt;
      &lt;/tr&gt;
    &lt;/table&gt;
 
  &lt;h4&gt;2. Austrian-German Female Speaker&lt;/h4&gt;
    &lt;p&gt;
      The following example is
      &lt;a href="https://www.podcast.de/episode/614697506/34dostojewski-wetter" target="_blank"&gt;an interview&lt;/a&gt; 
      with the Austrian Ex-Foreign Minister, Karin Kneissl, who uses &lt;b&gt;seven filler words&lt;/b&gt; within 26 seconds:
    &lt;/p&gt;
    &lt;p&gt; 
      &lt;!-- &lt;img id="screenshot" src="Screenshot_Kneissl_26s_NOtimeline_NOcursor.png"/&gt; --&gt;
      &lt;img id="screenshot" src="https://auphonic.com/media/blog/Screenshot_Kneissl_26s_NOtimeline.png" width="875"/&gt;
      &lt;!-- &lt;img src="Screenshot_Kneissl_26s_timeline.png"/&gt; --&gt;
    &lt;/p&gt; 
    &lt;table style="text-align: center; width: 100%; color: gray; font-size: small;"&gt;
      &lt;tr&gt;
          &lt;td style="padding: 0; vertical-align: middle;"&gt;Original:&lt;/td&gt;
          &lt;td style="width: 90%; padding: 0; vertical-align: middle;"&gt;
            &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/snippet2_REF_unipolar-multipolar_fillerNOcut_AllAlgRef.flac" type="audio/flac"&gt;&lt;/audio&gt;
          &lt;/td&gt;
      &lt;/tr&gt;
      &lt;tr&gt;
          &lt;td style="padding: 0; vertical-align: middle;"&gt;Cut:&lt;/td&gt;
          &lt;td style="width: 90%; padding: 0; vertical-align: middle;"&gt;
            &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/snippet2cut_CUT_unipolar-multipolar_fillercut_AllAlg.flac" type="audio/flac"&gt;&lt;/audio&gt;
          &lt;/td&gt;
      &lt;/tr&gt;
    &lt;/table&gt;


&lt;h3 id="instructions"&gt;Usage Instructions&lt;/h3&gt;
  &lt;p&gt;
    To use the Auphonic Automatic Filler Cutter feature, you just have to create a &lt;a href="https://auphonic.com/engine/" target="_blank"&gt;production&lt;/a&gt;
    or &lt;a href="https://auphonic.com/engine/presets/" target="_blank"&gt;preset&lt;/a&gt; as you are used to and select
    &lt;i&gt;&amp;#8220Cut Fillers&amp;#8221&lt;/i&gt; for &lt;i&gt;&amp;#8220Automatic Cutting&amp;#8221&lt;/i&gt; in the section &lt;i&gt;&amp;#8220Audio Algorithms&amp;#8221&lt;/i&gt;:
  &lt;/p&gt;
  &lt;img src="https://auphonic.com/media/blog/Screenshot_AutomaticFillerCutting_formBeta_width875.png" width="875"/&gt;
  &lt;p&gt;
    When your production is done, &lt;b&gt;all cut-out filler words will appear as pale red shaded areas&lt;/b&gt; in the 
    &lt;a href="https://auphonic.com/help/web/audioinspector.html#web-audioinspector" target="_blank"&gt;Auphonic Audio Inspector&lt;/a&gt; 
    on the production status page, as you can see in the upper &lt;a href="#screenshot"&gt;screenshot&lt;/a&gt; of the Audio Inspector.
  &lt;/p&gt; 
  &lt;!-- &lt;p&gt;
    You can also hide the cut regions by clicking the &lt;b&gt;[?] (show legend)&lt;/b&gt; button of the audio inspector 
    control bar and toggling the &lt;b&gt;Hide Cut Regions&lt;/b&gt; switch. With hidden cut segments, the timeline also 
    adjusts to the output file length, so the timestamps of the speech recognition transcript 
    (if generated) match the audio inspector again.
  &lt;/p&gt;  --&gt;
&lt;p&gt;
  If you want to &lt;b&gt;remove silent segments from your audio&lt;/b&gt; as well, please also enable our 
  &lt;a href="https://auphonic.com/blog/2023/05/09/automatic-silence-cutting/" target="_blank"&gt;
  Automatic Silence Cutting&lt;/a&gt; feature.
&lt;/p&gt;
  &lt;p&gt;
    &lt;b&gt;NOTE:&lt;/b&gt; Our Automatic Cutting features (for filler and silence) are not available for video files!
  &lt;/p&gt;



&lt;h3&gt;Behind the Scenes&lt;/h3&gt;
&lt;p&gt;
  For the training of our Automatic Filler Cutter &lt;a href="https://en.wikipedia.org/wiki/Artificial_intelligence" target="_blank"&gt;AI&lt;/a&gt;-Algorithm, 
  we created datasets that contain manually labeled audio files, collected from &lt;em&gt;'real world'&lt;/em&gt; audio data. 
  So far, we have &lt;b&gt;labeled, trained, and tested&lt;/b&gt; the system with &lt;b&gt;&lt;em&gt;English&lt;/em&gt;, &lt;em&gt;German&lt;/em&gt;, 
  &lt;em&gt;Spanish&lt;/em&gt;, and &lt;em&gt;French&lt;/em&gt;&lt;/b&gt; data.
&lt;/p&gt;
&lt;p&gt;
  However, in the &lt;a href="https://auphonic.com/" target="_blank"&gt;Auphonic Web Service&lt;/a&gt;, you can activate and &lt;b&gt;test the 
  Automatic Filler Word Cutter for all languages&lt;/b&gt;. We would be very happy to hear how the filler removal works out for 
  completely different-sounding languages from, e.g., the &lt;em&gt;Asian&lt;/em&gt;, &lt;em&gt;African&lt;/em&gt;, or &lt;em&gt;Slavic&lt;/em&gt; language families.  
&lt;/p&gt;
&lt;p&gt;
  &lt;b&gt;Please send us feedback on any problems or error patterns you discover!&lt;/b&gt; This will help us generate specific 
  data for the training to improve the algorithm and eliminate your problems.
&lt;/p&gt;


&lt;h3&gt;Conclusion&lt;/h3&gt;
  &lt;p&gt;
    Automatic filler word cutting is a powerful tool for podcasters looking to enhance the quality of their content. 
    It &lt;b&gt;boosts clarity and professionalism&lt;/b&gt;, all while making your editing process more efficient. 
    Some users, however, &lt;b&gt;see a touch of authenticity in filler words&lt;/b&gt; within podcasts. 
    So, we leave it up to you to &lt;b&gt;enable or disable the Automatic Filler Cutter&lt;/b&gt; feature for your 
    next Auphonic &lt;a href="https://auphonic.com/engine/" target="_blank"&gt;production&lt;/a&gt;, &lt;b&gt;depending on your desired style&lt;/b&gt;.
  &lt;/p&gt;
  &lt;p&gt;  
    We are currently working on filler word cutting optimizations for &lt;b&gt;more languages&lt;/b&gt;, 
    so &lt;a href="https://auphonic.com/contact" target="_blank"&gt;watch our channels&lt;/a&gt; to get all the news on our upgrades!
  &lt;/p&gt;
  &lt;p&gt;  
    If you have any &lt;b&gt;feedback&lt;/b&gt; for us &amp;ndash; &lt;b&gt;how the filler cutter is working in 
    your language&lt;/b&gt;, what you do or don't like, what you miss, what else you would want to 
    remove from your audio besides &lt;a href="https://auphonic.com/blog/2023/05/09/automatic-silence-cutting/" target="_blank"&gt;silence&lt;/a&gt; and filler words, etc. &amp;ndash; 
    you are welcome to &lt;a href="https://auphonic.com/contact" target="_blank"&gt;contact us&lt;/a&gt; 
    via email or directly comment on our production interface!
  &lt;/p&gt;


&lt;br&gt;
&lt;br&gt;
&lt;br&gt;
</description><author>isabell@auphonic.com (Isabell)</author><pubDate>Wed, 04 Oct 2023 08:44:33 +0000</pubDate><guid>https://auphonic.com/blog/2023/10/04/new-automatic-filler-word-cutter/</guid><enclosure length="100000" type="image/jpeg" url="http://auphonic.com/media/blog/fotor-ai-20231002142827_undoing_mistakes.jpg"/><category>Audio</category><category>Development</category><category>News</category></item><item><title>Automatically generate Shownotes, Summaries and Chapters from Recordings
</title><link>https://auphonic.com/blog/2023/06/12/auto-generate-shownotes-summaries-and-chapters-from-recordings/</link><description>
&lt;p&gt;
We're thrilled to introduce our Automatic Shownotes and
&lt;a href="https://auphonic.com/help/web/production.html#chapter-marks" target="_blank"&gt;Chapters&lt;/a&gt; feature.
This AI-powered tool effortlessly generates &lt;b&gt;concise summaries, intuitive chapter timestamps and relevant keywords&lt;/b&gt; for your podcasts, audio and video files.
&lt;br&gt;
See our &lt;a href="#example"&gt;Examples&lt;/a&gt; and the &lt;a href="#howto"&gt;How To&lt;/a&gt; section below for details.
&lt;/p&gt;

&lt;img src="https://auphonic.com/media/blog/auto-shownotes-cover.jpeg" width="630" height="420" /&gt; 


&lt;h3&gt;Why do I need Shownotes and Chapters?&lt;/h3&gt;
&lt;p&gt;
  In addition to links and other information, shownotes contain short summaries of the main topics of your episode, and inserted 
  &lt;a href="https://auphonic.com/help/web/production.html#chapter-marks" target="_blank"&gt;chapter marks&lt;/a&gt; allow you to timestamp sections with different topics of a podcast or video. 
  This makes your &lt;b&gt;content more accessible and user-friendly&lt;/b&gt;, enabeling listeners to quickly navigate to specific sections of the episode 
  or find a previous episode to brush up on a particular topic. &lt;br&gt;
&lt;/p&gt;
&lt;p&gt;
  Shownotes are also very likely to &lt;b&gt;boost your show's &lt;/b&gt;&lt;a href="https://en.wikipedia.org/wiki/Search_engine_optimization" target="_blank"&gt;Search Engine Optimization&lt;/a&gt; and eventually its popularity, &lt;b&gt;leading to an increase in listeners&lt;/b&gt;.
&lt;/p&gt;
&lt;p&gt;
  However, especially structuring the content and finding useful positions for &lt;a href="https://auphonic.com/help/web/production.html#chapter-marks" target="_blank"&gt;chapter marks&lt;/a&gt; is a very &lt;b&gt;time-consuming process, that can be fully automated&lt;/b&gt; with our new feature. &lt;br&gt;
&lt;/p&gt;
&lt;p&gt;
  Besides the obvious use of creating shownotes and chapters for podcasts, you can also use our new feature to easily generate an &lt;b&gt;abstract of your lecture&lt;/b&gt; recording, take the summary of your show as the &lt;b&gt;starting point for a social media post&lt;/b&gt;, or choose your favourite &lt;b&gt;chapter title as the podcast name&lt;/b&gt;.
&lt;/p&gt;


&lt;h3&gt;What happens behind the Scenes?&lt;/h3&gt;
&lt;p&gt;
  When the &lt;b&gt;Automatic Shownotes and Chapters&lt;/b&gt; feature is selected, the first step is speech transcription by either our internal &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#auphonic-whisper-asr" target="_blank"&gt;Auphonic Whisper ASR&lt;/a&gt; or any integrated &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#integrated-speech-recognition-services" target="_blank"&gt;External ASR Service&lt;/a&gt; of your choice.
&lt;/p&gt;
&lt;p&gt;
  Some open source tools and &lt;a href="https://chat.openai.com/" target="_blank"&gt;ChatGPT&lt;/a&gt; will then &lt;b&gt;summarize the ASR resulting text&lt;/b&gt; in different levels of detail, &lt;b&gt;analyze the content&lt;/b&gt; to identify sections with the different topics discussed, and finally complete each section with &lt;b&gt;timestamps for easy navigation&lt;/b&gt;. &lt;br&gt;
  Beginning with the generation of a &lt;em&gt;Long Summary&lt;/em&gt;, the number of characters is further reduced for a &lt;em&gt;Brief Summary&lt;/em&gt; and from the brief summary a &lt;em&gt;Subtitle&lt;/em&gt; and some &lt;em&gt;Keywords&lt;/em&gt; for the main topics are extracted. 
&lt;/p&gt;
&lt;p&gt;
  Depending on the duration of the input audio or video file, the level of detail of the thematic sections is also slightly adjusted, resulting in a &lt;b&gt;reasonable number of chapters&lt;/b&gt; for very short 5-minute audio files as well as for long 180-minute audio files.
&lt;/p&gt;


&lt;h3 id="howto"&gt;How to automatically generate Shownotes and Chapters in Auphonic&lt;/h3&gt;
&lt;p&gt;
  &lt;img src="https://auphonic.com/media/blog/SpeechRecognition_WhisperShownotes_2306.png" width="630"/&gt; 
  &lt;br&gt;
  If you are a &lt;em&gt;paying&lt;/em&gt; or &lt;em&gt;beta&lt;/em&gt; user, you can automatically generate shownotes and chapters &lt;b&gt;by checking the 
  &lt;em&gt;Automatic Shownotes and Chapters&lt;/em&gt; Checkbox&lt;/b&gt; in the Auphonic singletrack or multitrack &lt;a href="https://auphonic.com/engine/upload/" target="_blank"&gt;Production Form&lt;/a&gt; with any of our &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html" target="_blank"&gt;ASR Services&lt;/a&gt; enabled.  &lt;br&gt;
  Once your production is done, the generated data will show up in your transcript result files and in the well-known 
  &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#auphonic-transcript-editor" target="_blank"&gt;Auphonic Transcript Editor&lt;/a&gt; 
  above the speech recognition transcript section. &lt;br&gt;
  By clicking on a chapter title in the &lt;em&gt;Chapters&lt;/em&gt; section of the transcript editor, you can jump directly to that chapter in your transcript to review and edit that section.
&lt;/p&gt;
&lt;p&gt;
  Unless you have manually entered content before, the generated data will also be &lt;b&gt;automatically stored in your audio files'&lt;/b&gt; &lt;a href="https://auphonic.com/help/web/production.html#basic-metadata" target="_blank"&gt;metadata&lt;/a&gt; as follows:  &lt;br&gt;
  &lt;ul&gt;
    &lt;li&gt;Generated &lt;em&gt;Long Summary&lt;/em&gt; stored in metadata field &lt;em&gt;Summary&lt;/em&gt;. &lt;/li&gt; 
    &lt;li&gt;Generated &lt;em&gt;Subtitle&lt;/em&gt; stored in metadata field &lt;em&gt;Subtitle&lt;/em&gt;. &lt;/li&gt; 
    &lt;li&gt;Generated &lt;em&gt;Keywords&lt;/em&gt; stored in metadata field &lt;em&gt;Tags&lt;/em&gt;. &lt;/li&gt; 
    &lt;li&gt;Generated &lt;em&gt;Timestamps&lt;/em&gt; for thematic sections stored as &lt;em&gt;Start Time&lt;/em&gt; of Chapters Marks.&lt;/li&gt;
    &lt;li&gt;Generated &lt;em&gt;Headlines&lt;/em&gt; for thematic sections stored as &lt;em&gt;Chapter Title&lt;/em&gt; of Chapters Marks.&lt;/li&gt;
  &lt;/ul&gt;
  The metadata is &lt;b&gt;automatically displayed&lt;/b&gt; with your audio file &lt;b&gt;wherever you import your audio for further editing&lt;/b&gt;. 
&lt;/p&gt;        
&lt;p&gt; 
  Please note that not all of our supported &lt;a href="https://auphonic.com/help/web/production.html#output-files" target="_blank"&gt;Output File Formats&lt;/a&gt; are designed to use metadata.
  &lt;br&gt;For details see our previous blog posts:
  &lt;a href="https://auphonic.com/blog/2012/01/17/podcast-comparison-part-2-mp3-metadata-id3-tags/"&gt;ID3 Tags Metadata&lt;/a&gt; (used in MP3 output files),
  &lt;a href="https://auphonic.com/blog/2012/01/22/podcast-comparison-part-3-ogg-vorbis-metadata-vorbis-comment/"&gt;Vorbis Comment Metadata&lt;/a&gt;
  (used in FLAC, Opus and Ogg Vorbis output files) and
  &lt;a href="https://auphonic.com/blog/2012/02/18/podcast-comparison-part-4-mpeg-4-itunes-style-metadata-aac-audio-m4a-mp4/"&gt;MPEG-4 iTunes-style Metadata&lt;/a&gt;
  (used in AAC, M4A/M4B/MP4 and ALAC output files).
&lt;/p&gt;


&lt;!--h3&gt;Editing generated Data&lt;/h3&gt;
&lt;p&gt;
  If you are &lt;b&gt;unhappy with any detail&lt;/b&gt; in the generated keywords, summaries or chapters used as metadata, you can return to your production's status page and click &lt;em&gt;Edit Production&lt;/em&gt; under the feedback section. The previously generated metadata will now &lt;b&gt;show up in the production form&lt;/b&gt; (as &lt;a href="https://auphonic.com/help/web/production.html#extended-metadata" target="_blank"&gt;Extended Metadata&lt;/a&gt; and &lt;a href="https://auphonic.com/help/web/production.html#chapter-marks" target="_blank"&gt;Chapter Marks&lt;/a&gt;) &lt;b&gt;for you to edit&lt;/b&gt;. 
&lt;/p&gt;
&lt;p&gt;
  When you are finished editing, simply rerun the production to update your changes to your audio file's metadata.
  &lt;b&gt;Re-running the production will NOT cost you any additional credits&lt;/b&gt; on your Auphonic account if you only change the metadata and settings, but do &lt;b&gt;NOT change the inputfile!&lt;/b&gt; 
  Manually edited metadata would not be overwritten, however, we do recommend turning off &lt;em&gt;Speech Recognition&lt;/em&gt; and &lt;em&gt;Shownote Generation&lt;/em&gt; for the second run to save time and avoid additional costs for external ASR services. &lt;br&gt;
&lt;/p&gt;
&lt;p&gt;
  Of course, you can also use any third-party software of your choice to edit the metadata after downloading.&lt;br&gt;
&lt;/p--&gt;


&lt;h3 id=example&gt;Example&lt;/h3&gt;
&lt;p&gt;
    As a real-life example, we &lt;b&gt;automatically generated shownotes and chapters&lt;/b&gt; for the &lt;em&gt;Lex Fridman Podcast #367:&lt;/em&gt;  &lt;a href="https://youtu.be/L_Guz73e6fw" target="_blank"&gt;"Sam Altman: OpenAI CEO on GPT-4, ChatGPT, and the Future of AI"&lt;/a&gt;.&lt;br&gt;
&lt;/p&gt;
&lt;p style="text-align:center"&gt;
  &lt;b&gt;Check out our transcript and generated shownotes:&lt;/b&gt;&lt;br&gt;
  &lt;a href="https://auphonic.com/media/blog/LexFridmanPodcast367-transcript.html" target="_blank"&gt;LexFridmanPodcast367-transcript.html&lt;/a&gt;
&lt;/p&gt;
&lt;p align=“center“ style="text-align:center"&gt;
    &lt;iframe width="500" height="315" src="https://www.youtube.com/embed/L_Guz73e6fw"
            title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write;
            encrypted-media; gyroscope; picture-in-picture" allowfullscreen&gt;&lt;/iframe&gt;&lt;br&gt;
    &amp;#8194&lt;br&gt;
&lt;/p&gt;


&lt;h3&gt;Conclusion&lt;/h3&gt;
&lt;p&gt;
  The automatic generation of shownotes and chapters is &lt;b&gt;a huge time-saver for podcasters and video creators&lt;/b&gt;, 
  as it speeds up the tedious process of manually structuring and summarizing your content. &lt;br&gt;
&lt;/p&gt;
&lt;p&gt;
  For now it is available for all &lt;em&gt;paying&lt;/em&gt; or &lt;em&gt;beta&lt;/em&gt; users.
  If you would like to become a &lt;em&gt;beta&lt;/em&gt; user, or have any questions or feedback, please do not hesitate to &lt;a href="https://auphonic.com/contact" target="_blank"&gt;contact us&lt;/a&gt;! 
&lt;/p&gt;

&lt;br&gt;
&lt;br&gt;
&lt;br&gt;
</description><author>isabell@auphonic.com (Isabell)</author><pubDate>Mon, 12 Jun 2023 13:03:18 +0000</pubDate><guid>https://auphonic.com/blog/2023/06/12/auto-generate-shownotes-summaries-and-chapters-from-recordings/</guid><enclosure length="100000" type="image/jpeg" url="http://auphonic.com/media/blog/auto-shownotes-cover.jpeg"/><category>Development</category><category>News</category></item><item><title>New Auphonic AutoEQ Filtering (Beta)
</title><link>https://auphonic.com/blog/2023/01/24/autoeq-beta/</link><description>
&lt;style&gt;
    table, th, td{
      border: 1px solid black;
      border-collapse: collapse;
      border-color: #999;
      font-family: Helvetica,"Helvetica Neue",Arial,sans-serif;
      font-size: 11px; line-height: 22.5px; color: #555;
      text-align: center
    }
    td:nth-child(odd), th:nth-child(odd){
      background-color: #eee;
    }
&lt;/style&gt;

&lt;p&gt;
    In addition to our &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#adaptive-leveler" target="_blank"&gt;Leveler&lt;/a&gt;,
    &lt;a href="https://auphonic.com/blog/2022/06/10/new-noise-reduction-algorithms-beta/" target="_blank"&gt;Denoiser&lt;/a&gt;,
    and &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#adaptive-filtering" target="_blank"&gt;Adaptive &lt;i&gt;'Hi-Pass'&lt;/i&gt; Filter&lt;/a&gt;,
    we now release the &lt;b&gt;missing equalization feature&lt;/b&gt; with the new &lt;b&gt;Auphonic AutoEQ&lt;/b&gt;.
    &lt;br&gt;
    The AutoEQ automatically analyzes and optimizes the frequency spectrum of a voice recording,
    to remove sibilance (&lt;a href="https://en.wikipedia.org/wiki/De-essing" target="_blank"&gt;De-esser&lt;/a&gt;) and
    to create a clear, warm, and pleasant sound - listen to the &lt;a href="#examples"&gt;audio examples&lt;/a&gt; below to get an idea about what it does.
&lt;/p&gt;

&lt;p style="text-align: center; font-size: small;"&gt;
    &lt;a href="https://auphonic.com/media/blog/fabfilterProQ3_screenshot_big.jpg" target="_blank"&gt;
        &lt;img src="https://auphonic.com/media/blog/fabfilterProQ3_screenshot_small.jpg" width="630" height="382" /&gt;
    &lt;/a&gt;
    Screenshot of manually adjusted example settings for the equalizer plug-in &lt;i&gt;'Pro-Q3'&lt;/i&gt; by
    &lt;a href="https://www.fabfilter.com/products/pro-q-3-equalizer-plug-in" target="_blank"&gt;fabfilter&lt;/a&gt;.
&lt;/p&gt;


&lt;h3&gt;What is Equalization and why is it difficult?&lt;/h3&gt;
&lt;p&gt;
    &lt;b&gt;Equalization (EQ)&lt;/b&gt; in audio recording and reproduction is the process of &lt;b&gt;adjusting&lt;/b&gt; the volume of
    different &lt;b&gt;frequency bands&lt;/b&gt; within a signal.
    &lt;br&gt;
    The following vocal EQ cheat sheet, published by &lt;a href="https://shop.producerhive.com/products/vocal-eq-cheat-sheet" target="_blank"&gt;Producer Hive&lt;/a&gt;,
    will give you a small impression, of what can be influenced by equalizing:
&lt;/p&gt;

&lt;p style="text-align: center; font-size: small;"&gt;
    &lt;a href="https://auphonic.com/media/blog/ProducerHive_VocalEQCheatSheet.jpg" target="_blank"&gt;
        &lt;img src="https://auphonic.com/media/blog/ProducerHive_VocalEQCheatSheet_small.jpg" width="630" height="383"/&gt;
    &lt;/a&gt;
    &lt;i&gt;Vocal EQ Cheat Sheet&lt;/i&gt; by &lt;a href="https://shop.producerhive.com/products/vocal-eq-cheat-sheet" target="_blank"&gt;Producer Hive&lt;/a&gt;.
&lt;/p&gt;

&lt;p&gt;
    On the other hand, it is very &lt;b&gt;easy to ruin&lt;/b&gt; a good voice recording with &lt;b&gt;manual heavy-handed equalization&lt;/b&gt;, resulting in
    voices, that sound very sharp or muddy or even like the speaker had a blocked nose.&lt;br&gt;
    Besides the skill and experience of an audio engineer, manual adjustments of frequencies also require a very
    good and &lt;b&gt;linear studio playback device&lt;/b&gt;. For example, performing manual equalization with strongly bass-heavy
    speakers would most likely lead to a very sharp, unpleasant listening experience using treble-heavy headphones.&lt;br&gt;
&lt;/p&gt;
&lt;p&gt;
    For &lt;a href="https://auphonic.com/help/algorithms/singletrack.html" target="_blank"&gt;singletrack&lt;/a&gt; productions
    with &lt;b&gt;more than one speaker&lt;/b&gt;, equalizing is also a very complex and time-consuming process, as &lt;b&gt;every voice&lt;/b&gt;
    has its &lt;b&gt;unique frequency spectrum&lt;/b&gt; and needs its &lt;b&gt;own equalization&lt;/b&gt;. One could separate speakers with cuts or
    create a track envelope to fade from one speaker to another, however, any solution is a very &lt;b&gt;tedious job&lt;/b&gt;,
    if you do it by hand.
&lt;/p&gt;
&lt;p&gt;
    That is where the &lt;b&gt;Auphonic AutoEQ&lt;/b&gt; comes in! All those steps are now available in &lt;b&gt;just one click&lt;/b&gt;!
&lt;/p&gt;


&lt;h3&gt;How does the Auphonic AutoEQ work?&lt;/h3&gt;
&lt;p&gt;
    The &lt;a href="https://auphonic.com/engine/" target="_blank"&gt;Auphonic Web Service&lt;/a&gt; analyzes your audio content and
    &lt;b&gt;classifies the audio file into small and meaningful segments&lt;/b&gt; like music, silence, different speakers, etc. to
    process every single segment with the &lt;b&gt;best matching algorithms&lt;/b&gt;.&lt;br&gt;
    All our features like the &lt;a href="https://auphonic.com/blog/2018/11/05/auphonic-leveler-customization/" target="_blank"&gt;Adaptive Leveler&lt;/a&gt;,
    &lt;a href="https://auphonic.com/blog/2022/06/10/new-noise-reduction-algorithms-beta/" target="_blank"&gt; Dynamic Denoising&lt;/a&gt;,
    &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#adaptive-filtering" target="_blank"&gt;Adaptive &lt;i&gt;'Hi-Pass'&lt;/i&gt; Filtering&lt;/a&gt;,
    and now the new &lt;b&gt;AutoEQ&lt;/b&gt; filter option are built on top of this basic processing.
&lt;/p&gt;
&lt;p&gt;
    Using Auphonic AutoEQ, spectral EQ profiles are created for each speaker separately and permanently changing over time.
    The aim of those &lt;b&gt;time-dependent EQ profiles&lt;/b&gt; is to &lt;b&gt;create a constant, pleasant sound in the output file&lt;/b&gt;
    even if there are &lt;b&gt;slightly changing voices in the record&lt;/b&gt;, for example, due to modified speaker-microphone positions.

&lt;/p&gt;


&lt;h3 id="examples"&gt;Audio Examples&lt;/h3&gt;

&lt;p&gt;
    Here are two short audio examples, which demonstrate some features of our AutoEQ. &lt;br&gt;
    We recommend listening with headphones so you can hear all the details.
&lt;/p&gt;


&lt;h4&gt;Example 1. Female Speaker with Background Music&lt;/h4&gt;
&lt;p&gt;
In the following example (&lt;a href="https://www.bestofbcb.org/wu-207-science-at-treehouse-cafe-dec-7th-leader-of-antarctic-ice-studies/" target="_blank"&gt;BCB: The Voices of Bainbridge Island&lt;/a&gt;) of a female narrator speaking while background music is playing, you can easily recognize quite &lt;b&gt;sharp &lt;i&gt;'sss'&lt;/i&gt; sounds&lt;/b&gt; in the female voice. This sharpness in the female voice is &lt;b&gt;removed&lt;/b&gt; by the so-called &lt;a href="https://en.wikipedia.org/wiki/De-essing" target="_blank"&gt;De-essing&lt;/a&gt; feature of the Auphonic AutoEQ, while the &lt;b&gt;background music&lt;/b&gt; is &lt;b&gt;not changed&lt;/b&gt;.
&lt;/p&gt;
&lt;table style="text-align: center; width: 100%; color: gray; font-size: small;"&gt;
    &lt;tr&gt;
        &lt;td style="padding: 0; vertical-align: middle;"&gt;Original:&lt;/td&gt;
        &lt;td style="width: 90%; padding: 0; vertical-align: middle;"&gt;
          &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/yxq7fvXocdw5fXenUygAuJ_01.woman_cut_NOautoeq_NOdenoiose_cut19s.flac" type="audio/flac"&gt;&lt;/audio&gt;
        &lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
        &lt;td style="padding: 0; vertical-align: middle;"&gt;AutoEQed:&lt;/td&gt;
        &lt;td style="width: 90%; padding: 0; vertical-align: middle;"&gt;
          &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/yxq7fvXocdw5fXenUygAuJ_01.woman_cut_autoeq_NOdenoiose_cut19s_v2.flac" type="audio/flac"&gt;&lt;/audio&gt;
        &lt;/td&gt;
    &lt;/tr&gt;
&lt;/table&gt;


&lt;h4&gt;Example 2. Dialog of Male and Female Speakers&lt;/h4&gt;
&lt;p&gt;
    The next example (&lt;a href="https://www.bestofbcb.org/wu-207-science-at-treehouse-cafe-dec-7th-leader-of-antarctic-ice-studies/" target="_blank"&gt;BCB: The Voices of Bainbridge Island&lt;/a&gt;) shows how the AutoEQ optimizes a &lt;a href="https://auphonic.com/help/algorithms/singletrack.html" target="_blank"&gt;singletrack&lt;/a&gt; record containing two speakers with &lt;b&gt;different voice characteristics&lt;/b&gt;. Our AutoEQ algorithms analyze each voice separately and calculate the matching frequency adjustments to &lt;b&gt;optimize&lt;/b&gt; the voice of &lt;b&gt;every single speaker&lt;/b&gt;.
&lt;/p&gt;
&lt;table style="text-align: center; width: 100%; color: gray; font-size: small;"&gt;
    &lt;tr&gt;
        &lt;td style="padding: 0; vertical-align: middle;"&gt;Original:&lt;/td&gt;
        &lt;td style="width: 90%; padding: 0; vertical-align: middle;"&gt;
          &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/yxq7fvXocdw5fXenUygAuJ_02.man_NOautoeq_NOdenoiose_cut2_32-53_nopause.flac" type="audio/flac"&gt;&lt;/audio&gt;
        &lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
        &lt;td style="padding: 0; vertical-align: middle;"&gt;AutoEQed:&lt;/td&gt;
        &lt;td style="width: 90%; padding: 0; vertical-align: middle;"&gt;
          &lt;audio controls style="width: 100%;"&gt;&lt;source src="https://auphonic.com/media/blog/yxq7fvXocdw5fXenUygAuJ_02.man_autoeq_NOdenoiose_cut2_32-53_nopause_v2.flac" type="audio/flac"&gt;&lt;/audio&gt;
        &lt;/td&gt;
    &lt;/tr&gt;
&lt;/table&gt;


&lt;h3 id="use"&gt;AutoEQ Beta Integration in the Auphonic Web Service&lt;/h3&gt;
&lt;p&gt;
    To use the Auphonic AutoEQ, you just have to create a &lt;a href="https://auphonic.com/engine/" target="_blank"&gt;production&lt;/a&gt;
    or &lt;a href="https://auphonic.com/engine/presets/" target="_blank"&gt;preset&lt;/a&gt; as you are used to, toggle
    &lt;i&gt;&amp;#8220Advanced Parameters&amp;#8221&lt;/i&gt; on the top right in the section &lt;i&gt;&amp;#8220Audio Algorithms&amp;#8221&lt;/i&gt; and select
    &lt;i&gt;&amp;#8220Voice AutoEQ&amp;#8221&lt;/i&gt; within &lt;i&gt;&amp;#8220Filtering&amp;#8221&lt;/i&gt;:
&lt;/p&gt;

&lt;p style="text-align: center; font-size: small;"&gt;
    &lt;img src="https://auphonic.com/media/blog/autoeq-settings.png" /&gt;
&lt;/p&gt;

&lt;p&gt;
    For a first test period, the AutoEQ will only be available &lt;b&gt;for Beta and paying users&lt;/b&gt;, to incorporate your
    feedback and finalize an optimized version.&lt;br&gt;
    If you are a &lt;b&gt;free user&lt;/b&gt; but want to try Auphonic AutoEQ: please just
    &lt;a href="https://auphonic.com/contact" target="_blank"&gt;ask for access&lt;/a&gt;!
&lt;/p&gt;

&lt;h3&gt;Practical Tips&lt;/h3&gt;
&lt;p&gt;
    For &lt;b&gt;best results&lt;/b&gt; using Auphonic AutoEQ, however, it is still necessary, that &lt;b&gt;your audio content is of sufficiently good quality&lt;/b&gt;,
    as no equalizer can make up frequencies that are not there in the first place. Audio files with low bitrates often lack important 
    &lt;b&gt;frequencies&lt;/b&gt;, that &lt;b&gt;cannot be recovered&lt;/b&gt; by equalizing.
    AutoEQ is just a feature to &lt;b&gt;boost or cut individual frequency bands&lt;/b&gt;, not a bandwidth extension. 
    For more information about required audio quality, see the former blog post:
    &lt;a href="https://auphonic.com/blog/2011/07/13/audio-file-formats-podcasts/" target="_blank"&gt;Audio File Formats and Bitrates for Podcasts&lt;/a&gt;.
&lt;/p&gt;
&lt;p&gt;
    Another important topic is the definition of the &lt;i&gt;'best result'&lt;/i&gt;. &lt;b&gt;Equalizing&lt;/b&gt; is a very &lt;b&gt;subjective task&lt;/b&gt;, that differs a lot depending on every personal opinion. So &lt;b&gt;Auphonic AutoEQ&lt;/b&gt; is set up to follow quite conservative rules of equalizing and rather
    apply subtle tweaks and &lt;b&gt;remove obvious problems&lt;/b&gt;, than support personal preferences. 
    This also means, your record will experience &lt;b&gt;no significant changes&lt;/b&gt; from Auphonic AutoEQ if it sounds reasonably OK or &lt;b&gt;pretty good already&lt;/b&gt;.
&lt;/p&gt;


&lt;h3&gt;Conclusion&lt;/h3&gt;
&lt;p&gt;
    Auphonic audio post production algorithms keep getting better and better in leaps and bounds lately, offering you new Beta Features:
    &lt;a href="https://auphonic.com/blog/2022/06/10/new-noise-reduction-algorithms-beta/" target="_blank"&gt;Beta Auphonic Denoiser&lt;/a&gt;,
    &lt;a href="https://auphonic.com/blog/2022/11/08/auphonic-whisper-asr-beta/" target="_blank"&gt;Beta Auphonic Speech Recognition&lt;/a&gt;, and
    Beta Auphonic AutoEQ.&lt;br&gt;
    Right now we are &lt;b&gt;fine-tuning all our current Beta Features&lt;/b&gt; with high intensity to release a new &lt;b&gt;upgraded version&lt;/b&gt;
    of our &lt;a href="https://auphonic.com/engine/" target="_blank"&gt;Auphonic Web Service&lt;/a&gt; as soon as possible.&lt;br&gt;
    Please watch this &lt;a href="https://auphonic.com/blog/" target="_blank"&gt;channel&lt;/a&gt; for further updates &amp;ndash; soon to come.
&lt;/p&gt;
&lt;p&gt;
    If you have any &lt;b&gt;feedback&lt;/b&gt; for us or want to &lt;b&gt;become a Beta user&lt;/b&gt;, you are very welcome to comment directly
    in our production interface or to &lt;a href="https://auphonic.com/contact" target="_blank"&gt;contact us&lt;/a&gt; via email!
&lt;/p&gt;


&lt;br&gt;
&lt;br&gt;
&lt;br&gt;
</description><author>grh@auphonic.com (Georg)</author><pubDate>Tue, 24 Jan 2023 09:35:03 +0000</pubDate><guid>https://auphonic.com/blog/2023/01/24/autoeq-beta/</guid><enclosure length="100000" type="image/jpeg" url="http://auphonic.com/media/blog/fabfilterProQ3_screenshot_small.jpg"/><category>Audio</category><category>Development</category><category>News</category></item><item><title>Auphonic Speech Recognition Engine using Whisper by OpenAI (Beta)
</title><link>https://auphonic.com/blog/2022/11/08/auphonic-whisper-asr-beta/</link><description>
&lt;p&gt;
    Today we release our first self-hosted &lt;b&gt;Auphonic Speech Recognition Engine&lt;/b&gt;
    using the open-source &lt;b&gt;&lt;a href="https://openai.com/blog/whisper/" target="_blank"&gt;Whisper&lt;/a&gt; model
    by &lt;a href="https://openai.com/about/" target="_blank"&gt;OpenAI&lt;/a&gt;&lt;/b&gt;!&lt;br&gt;
    With Whisper, you can now integrate automatic speech recognition &lt;b&gt;in 99 languages&lt;/b&gt; into your
    &lt;a href="https://auphonic.com/" target="_blank"&gt;Auphonic&lt;/a&gt; audio post-production workflow,
    without creating an &lt;a href="https://auphonic.com/help/web/services.html" target="_blank"&gt;external account&lt;/a&gt;
    and &lt;b&gt;without extra costs&lt;/b&gt;!
&lt;/p&gt;

&lt;img src="https://auphonic.com/media/blog/auphonic_whisper_speechrec.png" width="500"&gt;

&lt;h3&gt;Whisper Speech Recognition in Auphonic&lt;/h3&gt;
&lt;p&gt;
    So far, Auphonic users had to choose one of our integrated external service providers
    (&lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#wit-ai" target="_blank"&gt;Wit.ai&lt;/a&gt;,
    &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#google-cloud-speech-api" target="_blank"&gt;Google Cloud Speech&lt;/a&gt;,
    &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#amazon-transcribe" target="_blank"&gt;Amazon Transcribe&lt;/a&gt;,
    &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#speechmatics" target="_blank"&gt;Speechmatics&lt;/a&gt;)
    for speech recognition, so audio files were transferred to an external server, using &lt;b&gt;external computing powers&lt;/b&gt;,
    that users had to pay for in their external accounts.
&lt;/p&gt;
&lt;p&gt;
    The &lt;b&gt;new Auphonic Speech Recognition is using &lt;a href="https://openai.com/blog/whisper/" target="_blank"&gt;Whisper&lt;/a&gt;&lt;/b&gt;,
    which was published by &lt;a href="https://openai.com/" target="_blank"&gt;OpenAI&lt;/a&gt; as an &lt;b&gt;open-source project&lt;/b&gt;.
    Open-source means, the publicly shared &lt;a href="https://github.com/openai/whisper/" target="_blank"&gt;GitHub repository&lt;/a&gt;
    contains a complete Whisper package including source code, examples, and &lt;a href="https://cdn.openai.com/papers/whisper.pdf"
    target="_blank"&gt;research results&lt;/a&gt;.&lt;br&gt;
    However, automatic speech recognition is a &lt;b&gt;very time and hardware-consuming process&lt;/b&gt;, that can be incredibly
    slow using a standard home computer without special GPUs.
    So we decided to integrate this service and offer you automatic speech recognition (ASR) by Whisper &lt;b&gt;processed on our
    own hardware&lt;/b&gt;, just like any other Auphonic processing task, giving you quite some &lt;b&gt;benefits&lt;/b&gt;:
    &lt;ul&gt;
        &lt;li&gt;&lt;b&gt;No external account is needed&lt;/b&gt; anymore to run ASR in Auphonic.&lt;/li&gt;
        &lt;li&gt;Your &lt;b&gt;data doesn't leave our Auphonic servers&lt;/b&gt; for ASR processing.&lt;/li&gt;
        &lt;li&gt;&lt;b&gt;No extra costs&lt;/b&gt; for external ASR services.&lt;/li&gt;
        &lt;li&gt;Additional &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#"
            target="_blank"&gt;Auphonic pre- and post-processing&lt;/a&gt; for &lt;b&gt;more accurate ASR&lt;/b&gt;, especially for
            &lt;a href="https://auphonic.com/help/web/multitrack.html" target="_blank"&gt;Multitrack Productions&lt;/a&gt;.&lt;/li&gt;
        &lt;li&gt;The quality of Whisper ASR is &lt;b&gt;absolutely comparable to the &lt;i&gt;&amp;#8220best&amp;#8221&lt;/i&gt; services&lt;/b&gt; in our &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#services-comparison-table"
       target="_blank"&gt;comparison table&lt;/a&gt;.&lt;/li&gt;
    &lt;/ul&gt;
&lt;/p&gt;


&lt;h3 id="use"&gt;How to use Whisper?&lt;/h4&gt;
&lt;p&gt;
    &lt;img src="https://auphonic.com/media/blog/whisper_form.png" width="605" /&gt;

    To use the Auphonic Whisper integration, you just have to create a &lt;a href="https://auphonic.com/engine/" target="_blank"&gt;production&lt;/a&gt;
    or &lt;a href="https://auphonic.com/engine/presets/" target="_blank"&gt;preset&lt;/a&gt; as you are used to and
    select &lt;i&gt;&amp;#8220Auphonic Whisper ASR&amp;#8221&lt;/i&gt; as &lt;i&gt;&amp;#8220Service&amp;#8221&lt;/i&gt; in the section
    &lt;a href="https://auphonic.com/help/web/production.html#speech-recognition" target="_blank"&gt;Speech Recognition&lt;/a&gt;.&lt;br&gt;
    This option will automatically appear &lt;b&gt;for Beta and paying users&lt;/b&gt;. If you are a &lt;b&gt;free user&lt;/b&gt; but want to
    try Whisper: please just &lt;a href="https://auphonic.com/contact" target="_blank"&gt;ask for access&lt;/a&gt;!
&lt;/p&gt;
&lt;p&gt;
    When your Auphonic speech recognition is done, you can download your transcript in different
    &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#auphonic-output-formats"
        target="_blank"&gt;formats&lt;/a&gt; and may edit or share your transcript with the
    &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#auphonic-transcript-editor"
        target="_blank"&gt;Auphonic Transcript Editor&lt;/a&gt;.&lt;br&gt;
    For more details about all our integrated speech recognition services, please visit our
    &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#integrated-speech-services"
    target="_blank"&gt;Speech Recognition Help&lt;/a&gt; and watch this
    &lt;a href="https://auphonic.com/blog/" target="_blank"&gt;channel&lt;/a&gt; for Whisper updates &amp;ndash; soon to come.
&lt;/p&gt;


&lt;h3 id="beta"&gt;Why Beta?&lt;/h4&gt;
&lt;p&gt;
    We decided to launch Whisper for &lt;b&gt;Beta and paying users only&lt;/b&gt;, as Whisper was just published end of
    September and there was &lt;b&gt;not enough time to test every single use case&lt;/b&gt; sufficiently.&lt;br&gt;
    Another issue is the &lt;b&gt;required computing power&lt;/b&gt;:
    for &lt;b&gt;suitable scaling of the GPU infrastructure&lt;/b&gt;, we need a beta phase to test the service
    while we are monitoring the hardware usage, to make sure there are no server overloads.
&lt;/p&gt;


&lt;h3 id="conclusion"&gt;Conclusion&lt;/h3&gt;
&lt;p&gt;
    Automatic speech recognition services are evolving very quickly, and we've seen major improvements over the past few
    years.&lt;br&gt;
    With Whisper, we can now perform &lt;b&gt;speech recognition without extra costs on our own GPU hardware&lt;/b&gt;,
    no &lt;a href="https://auphonic.com/help/web/services.html" target="_blank"&gt;external services&lt;/a&gt; are required anymore. &lt;br&gt;
&lt;/p&gt;
&lt;p&gt;
    Auphonic Whisper ASR is available &lt;b&gt;for Beta and paying users&lt;/b&gt; now, &lt;b&gt;free users&lt;/b&gt; can
    &lt;a href="https://auphonic.com/contact" target="_blank"&gt;ask for Beta access&lt;/a&gt;.
    &lt;br&gt;
    You are very welcome to send us feedback (directly in the production interface or via
    &lt;a href="https://auphonic.com/contact" target="_blank"&gt;email&lt;/a&gt;),
    whether you notice something that works particularly well or discover any problems. &lt;br&gt;
    &lt;b&gt;Your feedback is a great help&lt;/b&gt; to improve the system!
&lt;/p&gt;

&lt;br&gt;
&lt;br&gt;
&lt;br&gt;
</description><author>grh@auphonic.com (Georg)</author><pubDate>Tue, 08 Nov 2022 08:55:26 +0000</pubDate><guid>https://auphonic.com/blog/2022/11/08/auphonic-whisper-asr-beta/</guid><enclosure length="100000" type="image/png" url="http://auphonic.com/media/blog/auphonic_whisper_speechrec.png"/><category>Development</category><category>News</category></item><item><title>New Speechmatics API Integration and Speech Recognition Services Comparison
</title><link>https://auphonic.com/blog/2022/09/08/new-speechmatics-api-integration-and-speech-recognition-services-comparison/</link><description>
&lt;p&gt;
    &lt;a href="https://www.speechmatics.com/" target="_blank"&gt;Speechmatics&lt;/a&gt; released a new API including an
    &lt;b&gt;enhanced transcription engine&lt;/b&gt; (2h free per month!) that we integrated into the &lt;a href="https://auphonic.com/" target="_blank"&gt;Auphonic Web Service&lt;/a&gt; now. &lt;br&gt;
    In this blog post, we also compare the accuracy of all our integrated
    &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#integrated-speech-recognition-services" target="_blank"&gt;speech recognition services&lt;/a&gt; and present our
    &lt;a href="#comparision_table"&gt;results&lt;/a&gt;.
&lt;/p&gt;

&lt;img src="https://auphonic.com/media/blog/speechmatics-1200px-logo.png" width="500"&gt; &lt;/img&gt;
&lt;br&gt;

&lt;p&gt;
    Automatic speech recognition is most useful to
    &lt;a href="https://auphonic.com/blog/2016/12/02/make-podcasts-searchable-speech-to-text/" target="_blank"&gt;make audio searchable&lt;/a&gt;:
    Even if automatically generated
    transcripts are not perfect and might be difficult to read (spoken text is very different from written text),
    they are &lt;b&gt;very valuable&lt;/b&gt; if you try to &lt;b&gt;find a specific topic&lt;/b&gt; within a one-hour audio file or if
    you need the exact time of a quote in an audio archive. &lt;br&gt;
    Currently, Auphonic supports the integration of the following four speech recognition services:
    &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#wit-ai" target="_blank"&gt;Wit.ai&lt;/a&gt;,
    &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#google-cloud-speech-api" target="_blank"&gt;Google Cloud Speech&lt;/a&gt;,
    &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#amazon-transcribe" target="_blank"&gt;Amazon Transcribe&lt;/a&gt;, and
    &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#speechmatics" target="_blank"&gt;Speechmatics&lt;/a&gt;. &lt;br&gt;
    All speech recognition services are improving very quickly lately, and we'll do our best to keep you updated &amp;ndash;
    &lt;b&gt;getting closer and closer to perfection&lt;/b&gt;.
&lt;/p&gt;
&lt;p&gt;
    Most recently, &lt;a href="https://www.speechmatics.com/" target="_blank"&gt;Speechmatics&lt;/a&gt; developed a new Enhanced Model, that we added to our production services.
    So now, you do have the choice between the &lt;b&gt;Standard Model&lt;/b&gt; with faster results and medium good accuracy or the
    &lt;b&gt;Enhanced Model&lt;/b&gt; with slower results but very good accuracy.
    &lt;br&gt;
    For each transcription model, you can process
    &lt;b&gt;two hours of speech recognition per month for free&lt;/b&gt; (= 4h free per month combined). If you exceed the two hours per month and model, you will be
    charged &lt;b&gt;$1.25/h for Standard&lt;/b&gt; and &lt;b&gt;$1.90/h for Enhanced&lt;/b&gt; Model. For high volumes, you may contact the
    Speechmatics support for a discount.
&lt;/p&gt;


&lt;h3 id="comparision_table"&gt;How do other Speech Recognition Services compare to Speechmatics?&lt;/h3&gt;

&lt;p&gt;
    We tried to compare the &lt;b&gt;relative ASR&lt;/b&gt; (Automatic Speech Recognition) &lt;b&gt;quality of all services&lt;/b&gt; in English and German
    &amp;ndash;  &lt;i&gt;'best'&lt;/i&gt; means just the best one of our integrated services.
    &lt;br&gt;
    As speech recognition services are &lt;b&gt;evolving
    very quickly&lt;/b&gt;, this is just a snapshot and may change again in the near future.
&lt;/p&gt;

&lt;!--    10 15 23 20 16 16--&gt;
&lt;table style="width:100%"&gt;
    &lt;tr&gt;
        &lt;th style="width:10%; border: 1px solid; background: #eee;"&gt; &lt;/th&gt;
        &lt;th style="width:15%; border: 1px solid;"&gt;Wit.ai&lt;/th&gt;
        &lt;th style="width:20%; border: 1px solid; background: #eee;"&gt;Google Speech API&lt;/th&gt;
        &lt;th style="width:22%; border: 1px solid;"&gt;Amazon Transcribe&lt;/th&gt;
        &lt;th colspan="2"style="width:33%; border: 1px solid; background: #eee;"&gt;Speechmatics&lt;/th&gt;

    &lt;/tr&gt;
    &lt;tr&gt;
        &lt;th style="border: 1px solid; background: #eee;"&gt;Price &lt;/th&gt;
        &lt;td style="border: 1px solid;"&gt;&lt;b&gt;free&lt;/b&gt;, &lt;br&gt;
            also for commercial use&lt;/td&gt;
        &lt;td style="border: 1px solid; background: #eee;"&gt;1+1h free per month, &lt;br&gt;
            (Enhanced + Default Model), &lt;br&gt;
            then &lt;b&gt;~$0.96-$2.16/h&lt;/b&gt; &lt;br&gt;
            (depending on user settings)&lt;/td&gt;
        &lt;td style="border: 1px solid;"&gt;1h free per month, &lt;br&gt;
            (first 12 months), &lt;br&gt;
            then &lt;b&gt;~$1.44/h&lt;/b&gt;&lt;/td&gt;
        &lt;td style="border: 1px solid; background: #eee;"&gt;&lt;b&gt;Standard&lt;/b&gt; &lt;br&gt;
            2h free per month, &lt;br&gt;
            then &lt;b&gt;~$1.25/h&lt;/b&gt;, &lt;br&gt;
            much cheaper for high volumes&lt;/td&gt;
        &lt;td style="border: 1px solid;"&gt;&lt;b&gt;Enhanced&lt;/b&gt; &lt;br&gt;
            2h free per month, &lt;br&gt;
            then &lt;b&gt;~$1.90/h&lt;/b&gt; &lt;br&gt;
            much cheaper for high volumes&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
        &lt;th style="border: 1px solid; background: #eee;" style="border: 1px solid;"&gt;ASR Quality English&lt;/th&gt;
        &lt;td style="border: 1px solid;"&gt;basic&lt;/td&gt;
        &lt;td style="border: 1px solid; background: #eee;"&gt;good (Enhanced Model)&lt;/td&gt;
        &lt;td style="border: 1px solid;"&gt;very good&lt;/td&gt;
        &lt;td style="border: 1px solid; background: #eee;"&gt;very good&lt;/td&gt;
        &lt;td style="border: 1px solid;"&gt;best&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
        &lt;th style="border: 1px solid; background: #eee;"&gt;ASR Quality German&lt;/th&gt;
        &lt;td style="border: 1px solid;"&gt;basic&lt;/td&gt;
        &lt;td style="border: 1px solid; background: #eee;"&gt;basic (Default Model)&lt;/td&gt;
        &lt;td style="border: 1px solid;"&gt;very good&lt;/td&gt;
        &lt;td style="border: 1px solid; background: #eee;"&gt;very good&lt;/td&gt;
        &lt;td style="border: 1px solid;"&gt;best&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
        &lt;th style="border: 1px solid; background: #eee;"&gt;Keyword Support&lt;/th&gt;
        &lt;td style="border: 1px solid;"&gt;No&lt;/td&gt;
        &lt;td style="border: 1px solid; background: #eee;"&gt;Yes&lt;/td&gt;
        &lt;td style="border: 1px solid;"&gt;Yes&lt;/td&gt;
        &lt;td style="border: 1px solid; background: #eee;"&gt;Yes&lt;/td&gt;
        &lt;td style="border: 1px solid;"&gt;Yes&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
        &lt;th style="border: 1px solid; background: #eee;"&gt;Word Timestamps and Confidence&lt;/th&gt;
        &lt;td style="border: 1px solid;"&gt;No&lt;/td&gt;
        &lt;td style="border: 1px solid; background: #eee;"&gt;No&lt;/td&gt;
        &lt;td style="border: 1px solid;"&gt;Yes&lt;/td&gt;
        &lt;td style="border: 1px solid; background: #eee;"&gt;Yes&lt;/td&gt;
        &lt;td style="border: 1px solid;"&gt;Yes&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
        &lt;th style="border: 1px solid; background: #eee;"&gt;Speed&lt;/th&gt;
        &lt;td style="border: 1px solid;"&gt;fast&lt;/td&gt;
        &lt;td style="border: 1px solid; background: #eee;"&gt;fast&lt;/td&gt;
        &lt;td style="border: 1px solid;"&gt;much slower&lt;/td&gt;
        &lt;td style="border: 1px solid; background: #eee;"&gt;medium&lt;/td&gt;
        &lt;td style="border: 1px solid;"&gt;slower&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
        &lt;th style="border: 1px solid; background: #eee;"&gt;Supported Languages&lt;/th&gt;
        &lt;td style="border: 1px solid;"&gt;ar, bn, my, zh, nl, en, fi, fr, de, hi, id, it, ja, ca, ko, ms, ml, mr, pl, pt, ru, si, es, sv, tl, ta, th, tr, ur, vi&lt;/td&gt;
        &lt;td style="border: 1px solid; background: #eee;"&gt;&lt;b&gt;most languages supported!&lt;/b&gt; &lt;br&gt; 138 languages and dialects (see: &lt;a href="https://cloud.google.com/speech-to-text/docs/languages" target="_blank"&gt;Google Language Support&lt;/a&gt;)&lt;/td&gt;
        &lt;td style="border: 1px solid;"&gt;af-ZA, ar-AE, ar-SA, zh-CN, zh-TW, da-DK, nl-NL, en-AU, en-GB, en-IN, en-IE, en-NZ, en-AB, en-ZA, en-US, en-WL, fr-FR, fr-CA, fa-IR, de-DE, de-CH, he-IL, hi-IN, id-ID, it-IT, ja-JP, ko-KR, ms-MY, pt-PT, pt-BR, ru-RU, es-ES, es-US, ta-IN, te-IN, th-TH, tr-TR&lt;/td&gt;
        &lt;td style="border: 1px solid; background: #eee;"&gt;ar, bg, yue, ca, hr, cs, da, nl, en, fi, fr, de, el, hi, hu, it, id, ja, ko, lv, lt, ms, cmn, no, pl, pt, ro, ru, sk, sl, es, sv, tr, uk&lt;/td&gt;
        &lt;td style="border: 1px solid;"&gt;ar, bg, yue, ca, hr, cs, da, nl, en, fi, fr, de, el, hi, hu, it, id, ja, ko, lv, lt, ms, cmn, no, pl, pt, ro, ru, sk, sl, es, sv, tr, uk&lt;/td&gt;
    &lt;/tr&gt;
&lt;/table&gt;


&lt;h3&gt;Try out Speechmatics in Auphonic&lt;/h3&gt;

&lt;b&gt;1. Connect Speechmatics to your Auphonic Account&lt;/b&gt;

&lt;p&gt;
    Sign in to your Auphonic account, go to &lt;a href="https://auphonic.com/engine/services/" target="_blank"&gt;Services&lt;/a&gt;,
    and add Speechmatics as an
    &lt;a href="https://auphonic.com/engine/speechmatics/create/" target="_blank"&gt;External Service&lt;/a&gt;:
    &lt;img src="https://auphonic.com/media/blog/Speechmatics_new.png" width="630" /&gt;

    &lt;ul&gt;
    &lt;li&gt;Enter a display name for the Speechmatics service in your Auphonic account.&lt;/li&gt;
    &lt;li&gt;Sign up for a &lt;a href="https://page.speechmatics.com/portal-signup.html" target="_blank"&gt;Speechmatics Account&lt;/a&gt;.
        &lt;/li&gt;
    &lt;li&gt;On the Speechmatics page, go to &lt;i&gt;&amp;#8220Manage Access&amp;#8221&lt;/i&gt; on the left, choose a name for your key and click &lt;i&gt;&amp;#8220Generate API Key&amp;#8221&lt;/i&gt;.
        This API Key will only be shown once, so make sure you &lt;b&gt;keep it safe&lt;/b&gt;!
        Copy your generated API Key to your Auphonic account into the form field &lt;i&gt;&amp;#8220API Key&amp;#8221&lt;/i&gt;.&lt;/li&gt;
    &lt;li&gt;For &lt;i&gt;&amp;#8220Model Accuracy&amp;#8221&lt;/i&gt; please select &lt;i&gt;&amp;#8220Standard Model&amp;#8221&lt;/i&gt; or &lt;i&gt;&amp;#8220Enhanced Model&amp;#8221&lt;/i&gt;.&lt;/li&gt;
&lt;/ul&gt;
&lt;/p&gt;
&lt;p&gt;
    If you want to use both &lt;b&gt;Standard and Enhanced Models&lt;/b&gt; of Speechmatics once in a while, you need to create
    &lt;b&gt;two separate services&lt;/b&gt; (one service for each model) in your Auphonic account!
&lt;/p&gt;

&lt;b&gt;2. Add Speechmatics to your Auphonic Production&lt;/b&gt;

&lt;p&gt;
    Once your Speechmatics and Auphonic Accounts are connected, you can either create a
    &lt;a href="https://auphonic.com/help/web/preset.html#web-preset" target="_blank"&gt;preset&lt;/a&gt; or directly start your
    &lt;a href="https://auphonic.com/help/web/production.html#web-production" target="_blank"&gt;production&lt;/a&gt; just like you are used to. &lt;br&gt;
    In section &lt;b&gt;&amp;#8220Speech Recognition&amp;#8221&lt;/b&gt; you may set &lt;i&gt;&amp;#8220Service&amp;#8221&lt;/i&gt; to &lt;i&gt;&amp;#8220Speechmatics&amp;#8221&lt;/i&gt;,
    select the language of your audio, add &lt;i&gt;&amp;#8220Keywords&amp;#8221&lt;/i&gt; if you want, and you are &lt;b&gt;ready to &amp;#8220Start Production&amp;#8221!&lt;/b&gt;

    &lt;img src="https://auphonic.com/help/_images/SpeechRecognition.png" width="630" /&gt;

    In your Speechmatics account menu &amp;#8220&lt;a href="https://portal.speechmatics.com/usage/" target="_blank"&gt;Track Usage&lt;/a&gt;&amp;#8221
    there is a detailed list of your usage for the current month. For more information, you can also watch the following
    &lt;a href="https://www.youtube.com/watch?v=FdwuuHeUGFc" target="_blank"&gt;Video Tutorial by Speechmatics&lt;/a&gt; about usage, limits, and billing.
&lt;/p&gt;

&lt;b&gt;3. Correct Results using the Auphonic Transcript Editor&lt;/b&gt;
&lt;p&gt;
    Auphonic also includes a
    &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#auphonic-transcript-editor" target="_blank"&gt;Transcript Editor&lt;/a&gt;
    directly in our HTML output file. &lt;br&gt;
    If you use &lt;a href="https://www.speechmatics.com/" target="_blank"&gt;Speechmatics&lt;/a&gt;
    or &lt;a href="https://aws.amazon.com/transcribe/" target="_blank"&gt;Amazon Transcribe&lt;/a&gt;,
    the editor displays word confidence values to &lt;b&gt;instantly see which sections
    should be checked manually&lt;/b&gt;:
&lt;/p&gt;

&lt;img src="https://auphonic.com/help/_images/TranscriptEditor.png" width="630" /&gt;


&lt;h3&gt;Conclusion&lt;/h3&gt;
&lt;p&gt;
    Automatic Speech Recognition Services are evolving very quickly, and we've seen &lt;b&gt;great improvements&lt;/b&gt; since our
    last comparisons in 2018 &amp;ndash; especially in &lt;b&gt;recognizing sloppy language, accents, and dialects&lt;/b&gt;.
&lt;/p&gt;
&lt;p&gt;
    With the new Enhanced Transcription Model by Speechmatics, we can now pass on further optimizations to you at a very reasonable price (&lt;b&gt;4h free per month&lt;/b&gt;) &amp;ndash;
    and we guess there are more improvements to come pretty soon.
&lt;/p&gt;
&lt;p&gt;
    Also, &lt;b&gt;please &lt;a href="https://auphonic.com/contact" target="_blank"&gt;let us know&lt;/a&gt; if you get different results&lt;/b&gt; comparing ASR services or if you compare services in other languages!
&lt;/p&gt;

&lt;br&gt;&lt;br&gt;&lt;br&gt;
</description><author>freymat@auphonic.com (Matthias)</author><pubDate>Thu, 08 Sep 2022 07:46:34 +0000</pubDate><guid>https://auphonic.com/blog/2022/09/08/new-speechmatics-api-integration-and-speech-recognition-services-comparison/</guid><enclosure length="100000" type="image/png" url="http://auphonic.com/media/blog/speechmatics-1200px-logo.png"/><category>Development</category><category>News</category></item><item><title>New Advanced Leveler with Broadcast Parameters (MaxLRA, MaxS, MaxM)
</title><link>https://auphonic.com/blog/2020/12/04/advanced-leveler-broadcast-parameters/</link><description>
&lt;p&gt;
    Today we are thrilled to introduce revised parameters for the
    &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#adaptive-leveler" target="_blank"&gt;Adaptive Leveler&lt;/a&gt;
    to move our
    &lt;a href="https://auphonic.com/help/web/production.html#advanced-audio-algorithms" target="_blank"&gt;advanced algorithms&lt;/a&gt; out of beta.
    &lt;br&gt;
    The leveler can now run in three modes, which allow detailed
    &lt;b&gt;Leveler Strength&lt;/b&gt; control and also the use of &lt;b&gt;Broadcast Parameters&lt;/b&gt;
    (Max. Loudness Range, Max. Short-term Loudness, Max. Momentary Loudness)
    to limit the amount of leveling.
&lt;/p&gt;

&lt;p style="text-align: center; font-size: small;"&gt;
&lt;a href="https://unsplash.com/photos/v5QCF2PFHZ4" target="_blank"&gt;
  &lt;img src="https://auphonic.com/media/blog/broadcast-leveler-img.jpg" width="500" /&gt;
&lt;/a&gt;
Photo by &lt;a href="https://unsplash.com/photos/v5QCF2PFHZ4" target="_blank"&gt;Gemma Evans&lt;/a&gt;.
&lt;/p&gt;

&lt;p&gt;
    When we first
    &lt;a href="https://auphonic.com/blog/2018/08/21/leveler-presets-lra-target-advanced-audio-params-beta/" target="_blank"&gt;introduced our advanced parameters&lt;/a&gt;,
    we used the
    &lt;a href="#maxlra"&gt;Maximum Loudness Range (MaxLRA)&lt;/a&gt; value to control the strength of our leveler.
    This gave good results, but it turned out that
    &lt;a href="https://auphonic.com/blog/2019/08/19/dynamic-range-processing/#lra-problem" target="_blank"&gt;only pure speech programs give reliable and comparable LRA values&lt;/a&gt;
    and it was sometimes difficult to set a loudness range target for diverse audio content.
    To resolve this issue, we reworked the parameter and called it
    &lt;a href="https://auphonic.com/blog/2018/11/05/auphonic-leveler-customization/" target="_blank"&gt;Dynamic Range&lt;/a&gt;. &lt;br&gt;
    After discussions with our users, however, we received the feedback that the name
    &lt;em&gt;Dynamic Range&lt;/em&gt; was &lt;b&gt;too confusing&lt;/b&gt;, so we decided to call
    it &lt;a href="#leveler-strength"&gt;Leveler Strength&lt;/a&gt;.
&lt;/p&gt;
&lt;p&gt;
    In our discussions with users, we were also told they like to be able
    to set a loudness range target to limit the amount of leveling,
    because &lt;b&gt;MaxLRA is often used by broadcasters and in regulations&lt;/b&gt;. &lt;br&gt;
    As a solution, we added a &lt;a href="#broadcast-mode"&gt;Broadcast Mode&lt;/a&gt;,
    which makes it now possible to use the MaxLRA, MaxS, and MaxM values to control
    the strength of our leveler as well.
&lt;/p&gt;

&lt;p&gt;
    In this blog post, we will first discuss the new parameters - 
    &lt;em&gt;Leveler Strength&lt;/em&gt;, &lt;em&gt;Music Gain&lt;/em&gt;, &lt;em&gt;MusicSpeech Classifier Settings&lt;/em&gt;,
    &lt;em&gt;MaxLRA&lt;/em&gt;, &lt;em&gt;MaxS&lt;/em&gt;, and &lt;em&gt;MaxM&lt;/em&gt; - 
    of our &lt;a href="#singletrack"&gt;Singletrack Advanced Leveler&lt;/a&gt;,
    then we will show how these settings can be used in the
    &lt;a href="#multitrack"&gt;Multitrack Advanced Leveler&lt;/a&gt;.
&lt;/p&gt;


&lt;h3 id="singletrack"&gt;Singletrack Advanced Leveler&lt;/h3&gt;

&lt;p&gt;
    The &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#adaptive-leveler" target="_blank"&gt;Adaptive Leveler&lt;/a&gt;
    normalizes all speakers to a similar loudness
    so that a consumer in a car or subway doesn't feel the need to reach for the volume control.
    However, in other environments (living room, cinema, etc.) or in dynamic recordings, you might want &lt;b&gt;more level differences&lt;/b&gt;
    (&lt;a href="https://auphonic.com/blog/2019/08/19/dynamic-range-processing/" target="_blank"&gt;Dynamic Range, Loudness Range / LRA&lt;/a&gt;)
    between speakers and within music segments.
&lt;/p&gt;

&lt;p&gt;
    Our new parameters let users control the &lt;em&gt;Leveler Strength&lt;/em&gt;
    to adjust &lt;a href="https://auphonic.com/blog/2011/07/25/loudness-normalization-and-compression-podcasts-and-speech-audio/#mid-term" target="_blank"&gt; mid-term level differences&lt;/a&gt;,
    similar to a sound engineer using the faders of an audio mixer,
    and &lt;em&gt;Compressor Settings&lt;/em&gt;
    for &lt;a href="https://auphonic.com/blog/2011/07/25/loudness-normalization-and-compression-podcasts-and-speech-audio/#compressor" target="_blank"&gt;
    short-term dynamics control&lt;/a&gt;. &lt;br&gt;
    The Advanced Leveler can be used in three different &lt;em&gt;Modes&lt;/em&gt;: &lt;br&gt;
    &lt;a href="#default-mode"&gt;Default Mode&lt;/a&gt;, &lt;a href="#separate-ms-params"&gt;Separate MusicSpeech Parameters&lt;/a&gt;, and &lt;a href="#broadcast-mode"&gt;Broadcast Mode&lt;/a&gt;.
&lt;/p&gt;
&lt;p&gt;
    For more details, please see our
    &lt;a href="https://auphonic.com/help/web/production.html#leveler-parameters" target="_blank"&gt;Leveler Parameters Help&lt;/a&gt;.
&lt;/p&gt;


&lt;h3 id="default-mode"&gt;Default Mode&lt;/h3&gt;

&lt;p style="text-align: center; font-size: small;"&gt;
&lt;img src="https://auphonic.com/media/blog/adv-leveler-default-mode.png" width="610" /&gt;
&lt;/p&gt;

&lt;dl&gt;
    &lt;dt id="leveler-strength"&gt;&lt;b&gt;Leveler Strength:&lt;/b&gt;&lt;/dt&gt;
    &lt;dd&gt;
        The &lt;em&gt;Leveler Strength&lt;/em&gt; controls &lt;b&gt;how much leveling&lt;/b&gt; is applied:
        100% means full leveling, 0% means no leveling at all.
        Changing the &lt;em&gt;Leveler Strength&lt;/em&gt; increases/decreases the &lt;em&gt;Dynamic Range&lt;/em&gt; of the output file.
        &lt;br&gt;&lt;br&gt;
        &lt;em&gt;Example Use Case:&lt;/em&gt; &lt;br&gt;
        Lower &lt;em&gt;Leveler Strength&lt;/em&gt; values should be used if you want to keep more loudness differences in dynamic narration
        or dynamic music recordings (live concert/classical).
    &lt;/dd&gt;
    &lt;br&gt;
    &lt;dt id="compressor"&gt;&lt;b&gt;Compressor Settings:&lt;/b&gt;&lt;/dt&gt;
    &lt;dd&gt;
        Here you can select a preset value for micro-dynamics compression. &lt;br&gt;
        A compressor reduces the volume of short and loud spikes like the pronunciation of "p" and "t"
        or laughter
        (&lt;a href="https://auphonic.com/blog/2011/07/25/loudness-normalization-and-compression-podcasts-and-speech-audio/#compressor" target="_blank"&gt;short-term dynamics&lt;/a&gt;) and also &lt;b&gt;shapes the sound&lt;/b&gt; of your voice (making the sound more or less "processed" or "punchy").
    &lt;/dd&gt;
&lt;/dl&gt;


&lt;h3 id="separate-ms-params"&gt;Separate MusicSpeech Parameters&lt;/h3&gt;

&lt;p style="text-align: center; font-size: small;"&gt;
&lt;img src="https://auphonic.com/media/blog/adv-leveler-ms-mode.png" width="610" /&gt;
&lt;/p&gt;

&lt;p&gt;
    In the &lt;em&gt;Separate MusicSpeech Parameters&lt;/em&gt; mode, &lt;b&gt;independent settings for music and speech segments&lt;/b&gt;
    (&lt;em&gt;Music Leveler Strength&lt;/em&gt;, &lt;em&gt;Music Compressor&lt;/em&gt;)
    can be selected. &lt;br&gt;
    These settings allow you to use, for example, more leveling in speech segmets while keeping music and FX elements less processed.
&lt;/p&gt;
&lt;p&gt;
    You can also disable our music/speech classifier or add a gain
    to music segments:
&lt;/p&gt;

&lt;dl&gt;
    &lt;dt id="ms-classifier"&gt;&lt;b&gt;MusicSpeech Classifier:&lt;/b&gt;&lt;/dt&gt;
    &lt;dd&gt;
        Use our speech/music classifier to level music and speech segments separately, or override the classifier decision and treat the whole audio file as speech or music.
    &lt;/dd&gt;
    &lt;br&gt;
    &lt;dt id="music-gain"&gt;&lt;b&gt;Music Gain:&lt;/b&gt;&lt;/dt&gt;
    &lt;dd&gt;
        Add a gain to music segments, to make music louder or softer compared to the speech parts.
        Use the default setting (0 dB) to give music and speech parts a similar average loudness.
    &lt;/dd&gt;
&lt;/dl&gt;


&lt;h3 id="broadcast-mode"&gt;Broadcast Mode&lt;/h3&gt;

&lt;p style="text-align: center; font-size: small;"&gt;
&lt;img src="https://auphonic.com/media/blog/adv-leveler-broadcast-mode.png" width="610" /&gt;
&lt;/p&gt;

&lt;p&gt;
    The &lt;em&gt;Broadcast Mode&lt;/em&gt; uses different parameters,
    which are often used by broadcasters and in regulations,
    to &lt;b&gt;control the &lt;em&gt;Leveler Strength&lt;/em&gt;&lt;/b&gt;:
&lt;/p&gt;

&lt;dl&gt;
    &lt;dt id="maxlra"&gt;&lt;b&gt;Maximum Loudness Range (LRA):&lt;/b&gt;&lt;/dt&gt;
    &lt;dd&gt;
        The loudness range (LRA) indicates the variation of loudness throughout
        a program and is measured in &lt;b&gt;LU (loudness units)&lt;/b&gt; - for more details see
        &lt;a href="https://auphonic.com/blog/2012/08/02/loudness-measurement-and-normalization-ebu-r128-calm-act/#lra" target="_blank"&gt;Loudness Measurement and Normalization&lt;/a&gt; or
        &lt;a href="http://tech.ebu.ch/webdav/site/tech/shared/tech/tech3342.pdf" target="_blank"&gt;EBU Tech 3342&lt;/a&gt;.
        &lt;br&gt;
        The volume changes of our Leveler
        will be restricted so that the &lt;b&gt;LRA of the output file is below the selected value&lt;/b&gt; (if possible).
        &lt;br&gt;
        High LRA values will result in very dynamic output files, whereas low LRA values will result in compressed output audio.
        If the LRA value of your input file is already below the maximum loudness range value, no leveling at all will be applied.
        &lt;br&gt;&lt;br&gt;
        &lt;em&gt;Loudness Range&lt;/em&gt; values are &lt;b&gt;most reliable for pure speech programs&lt;/b&gt;:
        a typical LRA value for news programs is 3 LU; for talks and discussions, an LRA value of 5 LU is common.
        LRA values for features, radio dramas, movies, or music strongly depend on the individual character and might be in the range of 5 to 25 LU - for more information, please see
        &lt;a href="https://auphonic.com/blog/2019/08/19/dynamic-range-processing/#lra-problem" target="_blank"&gt;Where LRA falls short&lt;/a&gt;. &lt;br&gt;
        &lt;b&gt;Netflix&lt;/b&gt;, for instance,
        &lt;a href="https://partnerhelp.netflixstudios.com/hc/en-us/articles/360001794307-Netflix-Audio-Mix-Specifications-Best-Practices-v1-0#3_1" target="_blank"&gt;recommends&lt;/a&gt;
        an LRA of 4 to 18 LU for the overall program and 7 LU or less for dialog.
    &lt;/dd&gt;
    &lt;br&gt;
    &lt;dt id="maxs"&gt;&lt;b&gt;Maximum Short-term Loudness (MaxS):&lt;/b&gt;&lt;/dt&gt;
    &lt;dd&gt;
        Set a &lt;em&gt;Maximum Short-term Loudness&lt;/em&gt; target
        (3s measurement window, see &lt;a href="https://tech.ebu.ch/docs/tech/tech3341.pdf" target="_blank"&gt;EBU Tech 3341, Section 2.2&lt;/a&gt;)
        relative to your &lt;a href="https://auphonic.com/help/web/production.html#audio-algorithms" target="_blank"&gt;Global Loudness Normalization Target&lt;/a&gt;. &lt;br&gt;
        Our Adaptive Leveler will ensure that the MaxS loudness value of the output file,
        which are &lt;b&gt;loudness values measured with an integration time of 3s&lt;/b&gt;,
        will be &lt;b&gt;below this target&lt;/b&gt; (if possible). &lt;br&gt;
        For example, if the MaxS value is set to &lt;em&gt;+5 LU relative&lt;/em&gt; and the &lt;em&gt;Loudness Target&lt;/em&gt; to &lt;em&gt;-23 LUFS&lt;/em&gt;,
        then the absolute MaxS value of your output file will be restricted to &lt;em&gt;-18 LUFS&lt;/em&gt;.
        &lt;br&gt;&lt;br&gt;
        The &lt;em&gt;Max Short-term Loudness&lt;/em&gt; is used in certain regulations for short-form content and advertisements. &lt;br&gt;
        See for example &lt;a href="https://tech.ebu.ch/docs/r/r128s1.pdf" target="_blank"&gt;EBU R128 S1: Loudness Parameters for Short-form Content (advertisements, promos, etc.)&lt;/a&gt;,
        which recommends a &lt;em&gt;Max Short-term Loudness&lt;/em&gt; of &lt;em&gt;+5 LU relative&lt;/em&gt;.
    &lt;/dd&gt;
    &lt;br&gt;
    &lt;dt id="maxm"&gt;&lt;b&gt;Maximum Momentary Loudness (MaxM):&lt;/b&gt;&lt;/dt&gt;
    &lt;dd&gt;
        Similar to the &lt;em&gt;MaxS&lt;/em&gt; target, it's also possible to use a
        &lt;em&gt;Maximum Momentary Loudness&lt;/em&gt; target
        (0.4s measurement window, see &lt;a href="https://tech.ebu.ch/docs/tech/tech3341.pdf" target="_blank"&gt;EBU Tech 3341, Section 2.2&lt;/a&gt;)
        relative to your &lt;a href="https://auphonic.com/help/web/production.html#audio-algorithms" target="_blank"&gt;Global Loudness Normalization Target&lt;/a&gt;. &lt;br&gt;
        Our Adaptive Leveler will ensure that the MaxM loudness value of the output file,
        which are &lt;b&gt;loudness values measured with an integration time of 0.4s&lt;/b&gt;,
        will be &lt;b&gt;below this target&lt;/b&gt; (if possible).
        &lt;br&gt;&lt;br&gt;
        The &lt;em&gt;Max Momentary Loudness&lt;/em&gt; is used in certain regulations by broadcasters.
        For example, &lt;a href="https://www.pro-tools-expert.com/home-page/2018/8/23/has-netflix-turned-the-clock-back-10-years-or-is-their-new-loudness-delivery-spec-a-stroke-of-genius" target="_blank"&gt;CBC and Radio Canada require&lt;/a&gt; that
        the &lt;em&gt;Momentary Loudness&lt;/em&gt; must not exceed &lt;em&gt;+10 LU&lt;/em&gt; above the target loudness.
    &lt;/dd&gt;
&lt;/dl&gt;

&lt;p&gt;
    If it's not possible for the levels of the output file to be below the given
    &lt;em&gt;MaxLRA&lt;/em&gt;, &lt;em&gt;MaxS&lt;/em&gt;, or &lt;em&gt;MaxM&lt;/em&gt; target values,
    you will receive a &lt;b&gt;warning message&lt;/b&gt; via email and on the production page.
&lt;/p&gt;

&lt;p&gt;
    &lt;em&gt;Example Use Case:&lt;/em&gt; &lt;br&gt;
    The broadcast parameters can be used to generate automatic mixdowns with different LRA values
    for different target environments (very compressed environments like mobile devices or Alexa, or very dynamic ones like home cinema, etc.).
&lt;/p&gt;


&lt;h3 id="multitrack"&gt;Multitrack Advanced Leveler&lt;/h3&gt;

&lt;p&gt;
The new leveling parameters are also available in our
&lt;a href="https://auphonic.com/help/algorithms/multitrack.html" target="_blank"&gt;multitrack version&lt;/a&gt;. &lt;br&gt;
Here you can set separate &lt;a href="#leveler-track-params"&gt;leveling parameters per track&lt;/a&gt;,
and also use &lt;a href="#broadcast-mode"&gt;broadcast parameters&lt;/a&gt; in the
&lt;a href="#leveler-master-params"&gt;final mixdown&lt;/a&gt;, to ensure that your levels are below the given
&lt;em&gt;MaxLRA&lt;/em&gt;, &lt;em&gt;MaxS&lt;/em&gt;, or &lt;em&gt;MaxM&lt;/em&gt; target values.
&lt;/p&gt;

&lt;h3 id="leveler-track-params"&gt;Leveling Parameters per Track&lt;/h3&gt;

&lt;p style="text-align: center; font-size: small;"&gt;
&lt;img src="https://auphonic.com/media/blog/mt-adv-track-leveler.png" width="620" /&gt;
&lt;/p&gt;

&lt;p&gt;
    The parameters
    &lt;a href="#leveler-strength"&gt;Leveler Strength&lt;/a&gt;,
    &lt;a href="#compressor"&gt;Compressor&lt;/a&gt;,
    &lt;em&gt;MusicSpeech Classifier&lt;/em&gt;, &lt;em&gt;Stereo Panorama&lt;/em&gt;, and &lt;em&gt;Track Gain&lt;/em&gt;
    allow you to customize which parts of the track audio should be leveled,
    how much they should be leveled, and how much dynamic range compression should be applied.
&lt;/p&gt;
&lt;dl&gt;
    &lt;dt&gt;&lt;b&gt;MusicSpeech Classifier Setting:&lt;/b&gt;&lt;/dt&gt;
    &lt;dd&gt;
        Select between the &lt;i&gt;Speech Track&lt;/i&gt; and &lt;i&gt;Music Track&lt;/i&gt; Adaptive Leveler. &lt;br&gt;
        If this is set to &lt;i&gt;On&lt;/i&gt;, a classifier will decide if this is a music or speech track.
    &lt;/dd&gt;
    &lt;br&gt;
    &lt;dt&gt;&lt;b&gt;Stereo Panorama (Balance):&lt;/b&gt;&lt;/dt&gt;
    &lt;dd&gt;
        Change the stereo panorama (balance for stereo input files) of the current track.
    &lt;/dd&gt;
    &lt;br&gt;
    &lt;dt&gt;&lt;b&gt;Track Gain:&lt;/b&gt; (in the &lt;em&gt;Fore/Background&lt;/em&gt; section)&lt;/dt&gt;
    &lt;dd&gt;
        Increase/decrease the loudness of this track compared to other tracks. &lt;br&gt;
        This can be used to add gain to a music or a specific speech track,
        making it louder/softer compared to other tracks.
    &lt;/dd&gt;
&lt;/dl&gt;

&lt;p&gt;
    For more details, please see
    &lt;a href="https://auphonic.com/help/web/multitrack.html#leveler-parameters" target="_blank"&gt;Multitrack Leveler Parameters Help&lt;/a&gt;.
&lt;/p&gt;


&lt;h3 id="leveler-master-params"&gt;Leveling Master Parameters&lt;/h3&gt;

&lt;p style="text-align: center; font-size: small;"&gt;
&lt;img src="https://auphonic.com/media/blog/mt-adv-master-leveler.png" width="604" /&gt;
&lt;/p&gt;

&lt;p&gt;
    In addition to our
    &lt;a href="#leveler-track-params"&gt;track parameters&lt;/a&gt;, you can
    switch the &lt;b&gt;Leveler Mode&lt;/b&gt; in the master algorithm settings to &lt;b&gt;Broadcast Mode&lt;/b&gt;
    to control the combined leveling strength.
    &lt;br&gt;
    Volume changes of our leveling algorithms
    will be adjusted so that the &lt;b&gt;final mixdown&lt;/b&gt; of the multitrack production
    &lt;b&gt;meets&lt;/b&gt; the given &lt;em&gt;MaxLRA&lt;/em&gt;, &lt;em&gt;MaxS&lt;/em&gt;, or &lt;em&gt;MaxM&lt;/em&gt; &lt;b&gt;target values&lt;/b&gt; - as is done in the &lt;a href="#broadcast-mode"&gt;Singletrack Broadcast Mode&lt;/a&gt;.
&lt;/p&gt;

&lt;p&gt;
    For more details, please see
    &lt;a href="https://auphonic.com/help/web/multitrack.html#master-algorithm-parameters" target="_blank"&gt;Master Algorithm Parameters Help&lt;/a&gt;.
&lt;/p&gt;


&lt;h3&gt;Summary&lt;/h3&gt;

&lt;p&gt;
    We revised the leveling parameters to &lt;b&gt;end the beta phase&lt;/b&gt; of our
    &lt;a href="https://auphonic.com/help/web/production.html#advanced-audio-algorithms" target="_blank"&gt;advanced algorithms&lt;/a&gt;. &lt;br&gt;
    All advanced settings are stable and will not change significantly going forward.
&lt;/p&gt;

The following leveling parameters are available:
&lt;ul&gt;
    &lt;li&gt;
        &lt;b&gt;Leveler Strength:&lt;/b&gt; control the strength of the leveling algorithm
    &lt;/li&gt;
    &lt;li&gt;
        &lt;b&gt;Compressor:&lt;/b&gt; select a preset for short-term dynamics control
    &lt;/li&gt;
    &lt;li&gt;
        &lt;b&gt;Music Gain / Track Gain:&lt;/b&gt; make music parts louder/softer compared to speech parts
    &lt;/li&gt;
    &lt;li&gt;
        &lt;b&gt;MusicSpeech Classifier:&lt;/b&gt; use our classifier or set everything to speech/music
    &lt;/li&gt;
    &lt;li&gt;
        &lt;b&gt;Separate MusicSpeech Parameters Mode:&lt;/b&gt; separate controls for speech and music parts
    &lt;/li&gt;
    &lt;li&gt;
        &lt;b&gt;Broadcast Mode:&lt;/b&gt;
        use the parameters &lt;em&gt;Maximum Loudness Range (MaxLRA)&lt;/em&gt;,
        &lt;em&gt;Maximum Short-term Loudness (MaxS)&lt;/em&gt;, and &lt;em&gt;Maximum Momentary Loudness (MaxM)&lt;/em&gt;
        to control the leveling strength
    &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;
    All new settings are also &lt;b&gt;available in our API&lt;/b&gt;, please see
    &lt;a href="https://auphonic.com/help/api/details.html#advanced-algorithm-parameters" target="_blank"&gt;Singletrack&lt;/a&gt; and
    &lt;a href="https://auphonic.com/help/api/multitrack.html#multitrack-advanced-audio-algorithm-settings" target="_blank"&gt;Multitrack Advanced API Settings&lt;/a&gt;.
&lt;/p&gt;
&lt;p&gt;
    Don't hesitate to
    &lt;a href="https://auphonic.com/contact" target="_blank"&gt;contact us&lt;/a&gt;
    if you have any questions or feedback about our algorithms!
&lt;/p&gt;

&lt;br&gt;
&lt;br&gt;
&lt;br&gt;
</description><author>grh@auphonic.com (Georg)</author><pubDate>Fri, 04 Dec 2020 08:24:11 +0000</pubDate><guid>https://auphonic.com/blog/2020/12/04/advanced-leveler-broadcast-parameters/</guid><enclosure length="100000" type="image/jpeg" url="http://auphonic.com/media/blog/broadcast-leveler-img.jpg"/><category>Audio</category><category>Development</category><category>News</category></item><item><title>Advanced Multitrack Audio Algorithms Release (Beta)
</title><link>https://auphonic.com/blog/2019/03/29/advanced-multitrack-algorithms-beta/</link><description>
&lt;p&gt;
Last weekend, at the
&lt;a href="https://das-sendezentrum.de/subscribe/sub10" target="_blank"&gt;Subscribe10&lt;/a&gt; conference,
we released &lt;em&gt;Advanced Audio Algorithm Parameters&lt;/em&gt; for
&lt;a href="https://auphonic.com/help/web/multitrack.html" target="_blank"&gt;Multitrack Productions&lt;/a&gt;:
&lt;/p&gt;

&lt;a href="https://auphonic.com/media/blog/mt-advanced-parameters.png"&gt;
&lt;img src="https://auphonic.com/media/blog/mt-advanced-parameters.png" width="590px" height="232px" style="border: 1px solid #ddd;"&gt;
&lt;/a&gt;

&lt;p&gt;
We launched our advanced audio algorithm parameters for
&lt;a href="https://auphonic.com/help/web/production.html" target="_blank"&gt;Singletrack Productions&lt;/a&gt; last year.
Now these settings (and more) are available for
&lt;a href="https://auphonic.com/help/algorithms/multitrack.html" target="_blank"&gt;Multitrack Algorithms&lt;/a&gt; as well,
which gives you detailed control for each track of your production.
&lt;/p&gt;

The following new parameters are available:
&lt;ul&gt;
&lt;li&gt;&lt;a href="#forebackground"&gt;Fore/Background Settings:&lt;/a&gt; keep your music/clip tracks unchanged and set a custom background gain&lt;/li&gt;
&lt;li&gt;&lt;a href="#leveler"&gt;Multitrack Leveler Parameters:&lt;/a&gt; control the stereo panorama, leveling algorithm, dynamic range and compression&lt;/li&gt;
&lt;li&gt;&lt;a href="#denoise"&gt;Better Hum and Noise Reduction Controls&lt;/a&gt; for each track&lt;/li&gt;
&lt;li&gt;&lt;a href="#maxpeak"&gt;Maximum True Peak Level&lt;/a&gt; setting for the final mixdown&lt;/li&gt;
&lt;li&gt;&lt;a href="#api"&gt;Full API Support&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;
Please join our &lt;a href="#beta"&gt;private beta program&lt;/a&gt; and let us know how you use these new features or if you need even more control!
&lt;/p&gt;


&lt;h3 id="forebackground"&gt;Fore/Background Settings&lt;/h3&gt;

&lt;a href="https://auphonic.com/media/blog/mt-backforeground.png"&gt;
&lt;img src="https://auphonic.com/media/blog/mt-backforeground.png" width="584px"&gt;
&lt;/a&gt;

&lt;p&gt;
The parameter Fore/Background controls whether a track should be in &lt;em&gt;foreground&lt;/em&gt;, in &lt;em&gt;background&lt;/em&gt;, &lt;em&gt;ducked&lt;/em&gt;, or &lt;em&gt;unchanged&lt;/em&gt;, which is especially important for music or clip tracks. &lt;br&gt;
For more details, please see
&lt;a href="https://auphonic.com/help/algorithms/multitrack.html#automatic-ducking-foreground-and-background-tracks" target="_blank"&gt;
Automatic Ducking, Foreground and Background Tracks
&lt;/a&gt;.
&lt;/p&gt;

We now added the new option &lt;em&gt;Unchanged&lt;/em&gt; and a new parameter to set the level of background segments/tracks:
&lt;dl&gt;

&lt;dt&gt;Unchanged (Foreground):&lt;/dt&gt;
&lt;dd&gt;
We sometimes received complaints from users, which produced very complex music or clip tracks, that Auphonic
changes the levels too hard.&lt;br&gt;
If you set the parameter Fore/Background to the new option &lt;em&gt;Unchanged (Foreground)&lt;/em&gt;, &lt;b&gt;Level relations within this track won’t be changed at all&lt;/b&gt;.
It will be added to the final mixdown so that foreground/solo parts of this track will be as loud as (foreground) speech from other tracks.
&lt;/dd&gt;

&lt;dt&gt;Background Level:&lt;/dt&gt;
&lt;dd&gt;
It is now possible to set the level of background segments/tracks (compared to foreground segments) in background and ducking tracks. 
By default, background and ducking segments are 18dB softer than foreground segments.
&lt;/dd&gt;

&lt;/dl&gt;


&lt;h3 id="leveler"&gt;Leveler Parameters&lt;/h3&gt;

&lt;a href="https://auphonic.com/media/blog/mt-advanced-leveler.png"&gt;
&lt;img src="https://auphonic.com/media/blog/mt-advanced-leveler.png" width="598px"&gt;
&lt;/a&gt;

&lt;p&gt;
Similar to our &lt;em&gt;Singletrack Advanced Leveler Parameters&lt;/em&gt;
(see this
&lt;a href="https://auphonic.com/blog/2018/11/05/auphonic-leveler-customization/" target="_blank"&gt;previous blog post&lt;/a&gt;),
we also released leveling parameters for Multitrack Productions now.
&lt;br&gt;
The following advanced parameters for our
&lt;a href="https://auphonic.com/help/algorithms/multitrack.html#multitrack-adaptive-leveler" target="_blank"&gt;Multitrack Adaptive Leveler&lt;/a&gt;
can be set for each track and allow you to customize &lt;b&gt;which parts&lt;/b&gt; of the audio should be leveled, &lt;b&gt;how much&lt;/b&gt; they should be leveled, how much &lt;b&gt;dynamic range compression&lt;/b&gt; should be applied and to set the &lt;b&gt;stereo panorama&lt;/b&gt; (balance):
&lt;/p&gt;
&lt;dl&gt;

&lt;dt&gt;Leveler Preset:&lt;/dt&gt;
&lt;dd&gt;
Select the &lt;em&gt;Speech&lt;/em&gt; or &lt;em&gt;Music Leveler&lt;/em&gt; for this track. &lt;br&gt;
If set to &lt;em&gt;Automatic&lt;/em&gt; (default), a classifier will decide if this is a music or speech track.
&lt;/dd&gt;

&lt;dt&gt;Dynamic Range:&lt;/dt&gt;
&lt;dd&gt;
The parameter &lt;em&gt;Dynamic Range&lt;/em&gt; controls &lt;b&gt;how much leveling&lt;/b&gt; is applied: Higher values result in more dynamic output audio files (less leveling). If you want to increase the dynamic range by 3dB (or LU), just increase the &lt;em&gt;Dynamic Range&lt;/em&gt; parameter by 3dB. &lt;br&gt;
For more details, please see 
&lt;a href="https://auphonic.com/help/web/multitrack.html#leveler-parameters" target="_blank"&gt;Multitrack Leveler Parameters&lt;/a&gt;.
&lt;/dd&gt;

&lt;dt&gt;Compressor:&lt;/dt&gt;
&lt;dd&gt;
Select a preset for &lt;b&gt;Micro-Dynamics Compression&lt;/b&gt;: &lt;em&gt;Auto&lt;/em&gt;, &lt;em&gt;Soft&lt;/em&gt;, &lt;em&gt;Medium&lt;/em&gt;, &lt;em&gt;Hard&lt;/em&gt; or &lt;em&gt;Off&lt;/em&gt;. &lt;br&gt;
The &lt;em&gt;Compressor&lt;/em&gt; adjusts
&lt;a href="https://auphonic.com/blog/2011/07/25/loudness-normalization-and-compression-podcasts-and-speech-audio/#compressor" target="_blank"&gt;short-term dynamics&lt;/a&gt;, whereas the Leveler adjusts
&lt;a href="https://auphonic.com/blog/2011/07/25/loudness-normalization-and-compression-podcasts-and-speech-audio/#mid-term" target="_blank"&gt;mid-term level differences&lt;/a&gt;. &lt;br&gt;
For more details, please see 
&lt;a href="https://auphonic.com/help/web/multitrack.html#leveler-parameters" target="_blank"&gt;Multitrack Leveler Parameters&lt;/a&gt;.
&lt;/dd&gt;

&lt;dt&gt;Stereo Panorama (Balance):&lt;/dt&gt;
&lt;dd&gt;
Change the stereo panorama (balance for stereo input files) of the current track. &lt;br&gt;
Possible values: &lt;em&gt;L100&lt;/em&gt;, &lt;em&gt;L75&lt;/em&gt;, &lt;em&gt;L50&lt;/em&gt;, &lt;em&gt;L25&lt;/em&gt;, &lt;em&gt;Center&lt;/em&gt;, &lt;em&gt;R25&lt;/em&gt;, &lt;em&gt;R50&lt;/em&gt;, &lt;em&gt;R75&lt;/em&gt; and &lt;em&gt;R100&lt;/em&gt;.
&lt;/dd&gt;

&lt;/dl&gt;

&lt;p&gt;
If you understand German and want to know more about our Advanced Leveler Parameters and audio dynamics in general, watch our talk at the
&lt;a href="https://das-sendezentrum.de/subscribe/sub10" target="_blank"&gt;Subscribe10&lt;/a&gt; conference: &lt;br&gt;
&lt;b&gt;&lt;a href="https://www.youtube.com/watch?v=dUN8X8pY91I" target="_blank"&gt;Video: Audio Lautheit und Dynamik&lt;/a&gt;&lt;/b&gt;.
&lt;/p&gt;


&lt;h3 id="denoise"&gt;Better Hum and Noise Reduction Controls&lt;/h3&gt;

&lt;a href="https://auphonic.com/media/blog/mt-advanced-denoise.png"&gt;
&lt;img src="https://auphonic.com/media/blog/mt-advanced-denoise.png" width="558px"&gt;
&lt;/a&gt;

We now offer three parameters to control the combination of our
&lt;a href="https://auphonic.com/help/algorithms/multitrack.html#multitrack-noise-and-hum-reduction" target="_blank"&gt;
Multitrack Noise and Hum Reduction Algorithms
&lt;/a&gt;
for each input track:

&lt;dl&gt;

&lt;dt&gt;Noise Reduction Amount:&lt;/dt&gt;
&lt;dd&gt;
Maximum noise and hum reduction amount in dB, higher values remove more noise. &lt;br&gt;
In &lt;em&gt;Auto&lt;/em&gt; mode, a classifier decides if and how much noise reduction is necessary (to avoid artifacts). 
Set to a custom (non-Auto) value if you prefer more noise reduction or want to bypass our classifier.
&lt;/dd&gt;

&lt;dt&gt;Hum Base Frequency:&lt;/dt&gt;
&lt;dd&gt;
Set the hum base frequency to 50Hz or 60Hz (if you know it), or use &lt;em&gt;Auto&lt;/em&gt; to automatically detect the hum base frequency in each speech region.
&lt;/dd&gt;

&lt;dt&gt;Hum Reduction Amount:&lt;/dt&gt;
&lt;dd&gt;
Maximum hum reduction amount in dB, higher values remove more noise. &lt;br&gt;
In &lt;em&gt;Auto&lt;/em&gt; mode, a classifier decides how much hum reduction is necessary in each speech region.
Set it to a custom value (&gt; 0), if you prefer more hum reduction or want to bypass our classifier.
Use &lt;em&gt;Disable Dehum&lt;/em&gt; to disable hum reduction and use our noise reduction algorithms only.
&lt;/dd&gt;

&lt;/dl&gt;

&lt;p&gt;
Behavior of noise and hum reduction parameter combinations:
&lt;/p&gt;

&lt;table style="width:100%; border: 1px solid #e1e4e5;"&gt;
&lt;tr style="border: 1px solid #e1e4e5;"&gt;
	&lt;th&gt;Noise Reduction Amount&lt;/th&gt;
	&lt;th&gt;Hum Base Frequency&lt;/th&gt;
	&lt;th&gt;Hum Reduction Amount&lt;/th&gt;
	&lt;th&gt;&lt;/th&gt;
&lt;/tr&gt;
&lt;tr style="border: 1px solid #e1e4e5; background-color: #f3f6f6;
"&gt;
	&lt;td&gt;Auto&lt;/td&gt;
	&lt;td&gt;Auto&lt;/td&gt;
	&lt;td&gt;Auto&lt;/td&gt;
	&lt;td&gt;Automatic hum and noise reduction&lt;/td&gt;
&lt;/tr&gt;
&lt;tr style="border: 1px solid #e1e4e5;"&gt;
	&lt;td&gt;Auto or &gt; 0&lt;/td&gt;
	&lt;td&gt;*&lt;/td&gt;
	&lt;td&gt;Disabled&lt;/td&gt;
	&lt;td&gt;No hum reduction, only denoise&lt;/td&gt;
&lt;/tr&gt;
&lt;tr style="border: 1px solid #e1e4e5; background-color: #f3f6f6;
"&gt;
	&lt;td&gt;Disabled&lt;/td&gt;
	&lt;td&gt;50Hz&lt;/td&gt;
	&lt;td&gt;Auto or &gt; 0&lt;/td&gt;
	&lt;td&gt;Force 50Hz hum reduction, no denoise&lt;/td&gt;
&lt;/tr&gt;
&lt;tr style="border: 1px solid #e1e4e5;"&gt;
	&lt;td&gt;Disabled&lt;/td&gt;
	&lt;td&gt;Auto&lt;/td&gt;
	&lt;td&gt;Auto or &gt; 0&lt;/td&gt;
	&lt;td&gt;Automatic dehum, no denoise&lt;/td&gt;
&lt;/tr&gt;
&lt;tr style="border: 1px solid #e1e4e5; background-color: #f3f6f6;
"&gt;
	&lt;td&gt;12dB&lt;/td&gt;
	&lt;td&gt;60Hz&lt;/td&gt;
	&lt;td&gt;Auto or &gt; 0&lt;/td&gt;
	&lt;td&gt;Always do dehum (60Hz) and denoise (12dB)&lt;/td&gt;
&lt;/tr&gt;
&lt;/table&gt;


&lt;h3 id="maxpeak"&gt;Maximum True Peak Level&lt;/h3&gt;

&lt;a href="https://auphonic.com/media/blog/mt-maxpeak.png"&gt;
&lt;img src="https://auphonic.com/media/blog/mt-maxpeak.png" width="658px"&gt;
&lt;/a&gt;

&lt;p&gt;
In the
&lt;a href="https://auphonic.com/help/web/multitrack.html#master-audio-algorithms" target="_blank"&gt;Master Algorithm Settings
&lt;/a&gt; of your multitrack production, you can
set the maximum allowed true peak level of the processed output file,
which is controlled by the &lt;b&gt;True Peak Limiter&lt;/b&gt; after our
&lt;a href="https://auphonic.com/help/algorithms/multitrack.html#loudness-normalization-and-true-peak-limiter" target="_blank"&gt;Loudness Normalization
&lt;/a&gt;
algorithms.
&lt;/p&gt;

&lt;p&gt;
If set to &lt;em&gt;Auto&lt;/em&gt; (which is the current default), a reasonable value according to the selected loudness target is used:
-1dBTP for 23 LUFS (EBU R128) and higher, -2dBTP for -24 LUFS (ATSC A/85) and lower loudness targets.
&lt;/p&gt;


&lt;h3 id="api"&gt;Full API Support&lt;/h3&gt;

&lt;p&gt;
All advanced algorithm parameters, for
&lt;a href="https://auphonic.com/help/web/production.html" target="_blank"&gt;Singletrack&lt;/a&gt; and
&lt;a href="https://auphonic.com/help/web/multitrack.html" target="_blank"&gt;Multitrack&lt;/a&gt; Productions,
are available in our
&lt;a href="https://auphonic.com/help/api/index.html" target="_blank"&gt;API&lt;/a&gt;
as well,
which allows you to integrate them into your scripts, external workflows and third-party applications.
&lt;/p&gt;

&lt;dt&gt;Singletrack API:&lt;/dt&gt;
&lt;dd&gt;
Documentation on how to use the advanced algorithm parameters in our singletrack production API:
&lt;a href="https://auphonic.com/help/api/details.html#advanced-algorithm-parameters" target="_blank"&gt;Advanced Algorithm Parameters&lt;/a&gt;
&lt;/dd&gt;

&lt;dt&gt;Multitrack API:&lt;/dt&gt;
&lt;dd&gt;
Documentation of advanced settings for each track of a multitrack production: &lt;br&gt;
&lt;a href="https://auphonic.com/help/api/multitrack.html#multitrack-advanced-audio-algorithm-settings" target="_blank"&gt;Multitrack Advanced Audio Algorithm Settings&lt;/a&gt;
&lt;/dd&gt;

&lt;/dl&gt;


&lt;h3 id="beta"&gt;Join the Beta and Send Feedback&lt;/h3&gt;

&lt;p&gt;
Please join our beta and &lt;a href="https://auphonic.com/contact" target="_blank"&gt;let us know&lt;/a&gt; your case studies, if you need
any other algorithm parameters or if you have any questions!
&lt;/p&gt;

&lt;p&gt;
Here are some private beta invitation codes:
&lt;pre&gt;
8tZPc3T9pH VAvO8VsDg9 0TwKXBW4Ni kjXJMivtZ1 J9APmAAYjT Zwm6HabuFw HNK5gF8FR5 Do1MPHUyPW CTk45VbV4t xYOzDkEnWP
9XE4dZ0FxD 0Sl3PxDRho uSoRQxmKPx TCI62OjEYu 6EQaPYs7v4 reIJVOwIr8 7hPJqZmWfw kti3m5KbNE GoM2nF0AcN xHCbDC37O5
6PabLBRm9P j2SoI8peiY olQ2vsmnfV fqfxX4mWLO OozsiA8DWo weJw0PXDky VTnOfOiL6l B6HRr6gil0 so0AvM1Ryy NpPYsInFqm
oFeQPLwG0k HmCOkyaX9R G7DR5Sc9Kv MeQLSUCkge xCSvPTrTgl jyQKG3BWWA HCzWRxSrgW xP15hYKEDl 241gK62TrO Q56DHjT3r4
9TqWVZHZLE aWFMSWcuX8 x6FR5OTL43 Xf6tRpyP4S tDGbOUngU0 5BkOF2I264 cccHS0KveO dT29cF75gG 2ySWlYp1kp iJWPhpAimF
&lt;/pre&gt;
We are happy to send further &lt;b&gt;invitation codes&lt;/b&gt; to all interested users - please do not hesitate to
&lt;a href="https://auphonic.com/contact" target="_blank"&gt;contact us&lt;/a&gt;!
&lt;/p&gt;

&lt;p&gt;
If you have an invitation code, you can enter it here to activate the Multitrack Advanced Audio Algorithm Parameters:
&lt;br&gt;
&lt;b style="text-align: center; display: block;"&gt;&lt;a href="https://auphonic.com/private_beta" target="_blank"&gt;Auphonic Algorithm Parameters Private Beta Activation&lt;/a&gt;&lt;/b&gt;
&lt;/p&gt;

&lt;br&gt;
&lt;br&gt;
&lt;br&gt;
</description><author>grh@auphonic.com (Georg)</author><pubDate>Fri, 29 Mar 2019 10:16:41 +0000</pubDate><guid>https://auphonic.com/blog/2019/03/29/advanced-multitrack-algorithms-beta/</guid><enclosure length="100000" type="image/png" url="http://auphonic.com/media/blog/mt-advanced-parameters.png"/><category>Audio</category><category>Development</category><category>News</category></item><item><title>More Languages for Amazon Transcribe Speech Recognition
</title><link>https://auphonic.com/blog/2019/01/31/more-languages-for-amazon-transcribe/</link><description>
&lt;p&gt;
Until recently,
&lt;a href="https://aws.amazon.com/de/transcribe/" target="_blank"&gt;Amazon Transcribe&lt;/a&gt;
supported speech recognition in &lt;b&gt;English and Spanish&lt;/b&gt; only. &lt;br&gt;
Now they included &lt;b&gt;French, Italian and Portuguese&lt;/b&gt; as well - and a few other languages (including 
&lt;a href=https://docs.aws.amazon.com/de_de/transcribe/latest/dg/API_StartTranscriptionJob.html#transcribe-StartTranscriptionJob-request-LanguageCode"" target="_blank"&gt;German&lt;/a&gt;)
are in private beta.
&lt;/p&gt;

&lt;p&gt;
&lt;b&gt;Update March 2019:&lt;/b&gt; &lt;br&gt;
Now Amazon Transcribe supports &lt;b&gt;German&lt;/b&gt; and &lt;b&gt;Korean&lt;/b&gt; as well.
&lt;/p&gt;

&lt;p style="text-align: center; font-size: small;"&gt;
&lt;a href="https://auphonic.com/static/screenshots/inspector-mt-closed.png" target="_blank"&gt;&lt;img style="width: 100%; max-width: 955px;" alt="https://auphonic.com/static/screenshots/inspector-mt-closed.png" src="https://auphonic.com/static/screenshots/inspector-mt-closed.png" /&gt;&lt;/a&gt;
The Auphonic &lt;a href="https://auphonic.com/help/web/audioinspector.html" target="_blank"&gt;Audio Inspector&lt;/a&gt; on the status page
of a finished &lt;a target="_blank" href="https://auphonic.com/help/web/multitrack.html"&gt;Multitrack Production&lt;/a&gt; including speech recognition. &lt;br&gt;
Please click on the screenshot to see it in full resolution!
&lt;/p&gt;

&lt;br&gt;
&lt;p&gt;
Amazon Transcribe is integrated as
&lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html" target="_blank"&gt;speech recognition engine&lt;/a&gt;
within Auphonic and offers &lt;b&gt;accurate transcriptions&lt;/b&gt;
(compared to
&lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#services-comparison-table" target="_blank"&gt;other services&lt;/a&gt;)
at &lt;b&gt;low costs&lt;/b&gt;, including keywords / custom vocabulary support, word confidence, timestamps, and punctuation. &lt;br&gt;
See the following AWS blog post and video for more information about recent Amazon Transcribe developments:
&lt;a href="https://aws.amazon.com/fr/blogs/machine-learning/transcribe-speech-in-three-new-languages-french-italian-and-brazilian-portuguese/" target="_blank"&gt;Transcribe speech in three new languages: French, Italian, and Brazilian Portuguese&lt;/a&gt;.
&lt;/p&gt;

&lt;p&gt;
Amazon Transcribe is also a perfect fit if you want to use our
&lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#auphonic-transcript-editor" target="_blank"&gt;Transcript Editor&lt;/a&gt;
because you will be able to see word timestamps and
&lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#word-confidence-highlighting" target="_blank"&gt;confidence values&lt;/a&gt;
to instantly &lt;b&gt;check which section/words should be corrected manually&lt;/b&gt; to increase the transcription accuracy:
&lt;/p&gt;

&lt;p style="text-align: center; font-size: small;"&gt;
&lt;img width="635" height="296" src="https://auphonic.com/media/blog/TranscriptEditor.png" /&gt;&lt;br&gt;
Screenshot of our &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#auphonic-transcript-editor" target="_blank"&gt;Transcript Editor&lt;/a&gt; with word confidence highlighting and the edit bar.
&lt;/p&gt;

&lt;p&gt;
These features are also available if you use
&lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#speechmatics" target="_blank"&gt;Speechmatics&lt;/a&gt;,
but unfortunately not in our other
&lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#integrated-speech-recognition-services" target="_blank"&gt;integrated speech recognition services&lt;/a&gt;.
&lt;/p&gt;


&lt;h3&gt;About Speech Recognition within Auphonic&lt;/h3&gt;

&lt;p&gt;
Auphonic has built a layer on top of a few external
&lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#integrated-speech-recognition-services" target="_blank"&gt;speech recognition services&lt;/a&gt;
to &lt;b&gt;make audio searchable&lt;/b&gt;: &lt;br&gt;
Our classifiers generate metadata during the analysis of an audio signal (music segments, silence, multiple speakers, etc.) to divide the audio file into small and meaningful segments, which are processed by the speech recognition engine.
The results from all segments are then combined, and meaningful timestamps, simple punctuation and structuring are added to the resulting text.
&lt;/p&gt;
&lt;p&gt;
To &lt;b&gt;learn more&lt;/b&gt; about speech recognition within Auphonic, take a look at our
&lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html" target="_blank"&gt;Speech Recognition&lt;/a&gt;
and &lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#auphonic-transcript-editor" target="_blank"&gt;Transcript Editor&lt;/a&gt;
help pages or listen to our
&lt;a href="https://auphonic.com/audio_examples#speechrec" target="_blank"&gt;Speech Recognition Audio Examples&lt;/a&gt;.
&lt;/p&gt;
&lt;p&gt;
A &lt;b&gt;comparison table&lt;/b&gt; of our integrated services
(price, quality, languages, speed, features, etc.)
can be found here:
&lt;a href="https://auphonic.com/help/algorithms/speech_recognition.html#services-comparison-table" target="_blank"&gt;Speech Recognition Services Comparison&lt;/a&gt;.
&lt;/p&gt;


&lt;h3&gt;Conclusion&lt;/h3&gt;

&lt;p&gt;
We hope that Amazon and others will continue to add new languages, to get accurate and
inexpensive automatic speech recognition in many languages.
&lt;/p&gt;

&lt;p&gt;
Don't hesitate to &lt;a href="https://auphonic.com/contact" target="_blank"&gt;contact us&lt;/a&gt; if you have any questions or feedback about speech recognition
or our transcript editor!
&lt;/p&gt;

&lt;br&gt;
&lt;br&gt;
</description><author>grh@auphonic.com (Georg)</author><pubDate>Thu, 31 Jan 2019 10:30:26 +0000</pubDate><guid>https://auphonic.com/blog/2019/01/31/more-languages-for-amazon-transcribe/</guid><enclosure length="100000" type="image/png" url="http://auphonic.com/static/screenshots/inspector-mt-closed.png"/><category>Development</category><category>News</category></item><item><title>Auphonic Adaptive Leveler Customization (Beta Update)
</title><link>https://auphonic.com/blog/2018/11/05/auphonic-leveler-customization/</link><description>
&lt;p&gt;
In late August,
&lt;a href="https://auphonic.com/blog/2018/08/21/leveler-presets-lra-target-advanced-audio-params-beta/" target="_blank"&gt;we launched the private beta program&lt;/a&gt;
of our &lt;b&gt;advanced audio algorithm parameters&lt;/b&gt;.
After feedback by our users and many new experiments, we are proud to release a complete rework of the
&lt;a href="https://auphonic.com/help/algorithms/singletrack.html#adaptive-leveler" target="_blank"&gt;Adaptive Leveler&lt;/a&gt; parameters:
&lt;/p&gt;

&lt;a href="https://auphonic.com/media/blog/AdvancedAlgsUpdate.png"&gt;
&lt;img src="https://auphonic.com/media/blog/AdvancedAlgsUpdate.png" width="585px" style="border: 1px solid #ddd;"&gt;
&lt;/a&gt;

&lt;p&gt;
In the
&lt;a href="#https://auphonic.com/blog/2018/08/21/leveler-presets-lra-target-advanced-audio-params-beta/"&gt;previous version&lt;/a&gt;,
we based our &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#adaptive-leveler" target="_blank"&gt;Adaptive Leveler&lt;/a&gt; parameters on the
&lt;a href="https://auphonic.com/blog/2012/08/02/loudness-measurement-and-normalization-ebu-r128-calm-act/#lra" target="_blank"&gt;Loudness Range&lt;/a&gt; descriptor (LRA),
which is included in the
&lt;a href="http://tech.ebu.ch/webdav/site/tech/shared/tech/tech3342.pdf" target="_blank"&gt;EBU R128 specification&lt;/a&gt;. &lt;br&gt;
Although it worked, it turned out that it is very difficult to set a loudness range target for diverse audio content, which does include speech, background sounds, music parts, etc.
The results were not predictable and it was hard to find good target values. &lt;br&gt;
Therefore we developed our own algorithm to measure the
&lt;b&gt;dynamic range&lt;/b&gt; of audio signals, which works similarly for speech, music and other audio content.
&lt;/p&gt;


&lt;p&gt;
The following advanced parameters for our
&lt;a href="https://auphonic.com/help/algorithms/singletrack.html#adaptive-leveler" target="_blank"&gt;Adaptive Leveler&lt;/a&gt;
allow you to customize &lt;b&gt;which parts&lt;/b&gt; of the audio should be leveled (foreground, all, speech, music, etc.), &lt;b&gt;how much&lt;/b&gt; they should be leveled (dynamic range), and &lt;b&gt;how much micro-dynamics compression&lt;/b&gt; should be applied.
&lt;/p&gt;

&lt;p&gt;
To try out the new algorithms,
please join our &lt;a href="#beta"&gt;private beta program&lt;/a&gt; and let us know your feedback!
&lt;/p&gt;


&lt;h3&gt;Leveler Preset&lt;/h3&gt;

&lt;p&gt;
The &lt;em&gt;Leveler Preset&lt;/em&gt; defines which parts of the audio should be adjusted by our
&lt;a href="https://auphonic.com/help/algorithms/singletrack.html#adaptive-leveler" target="_blank"&gt;Adaptive Leveler&lt;/a&gt;:
&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
    &lt;b&gt;Default Leveler:&lt;/b&gt;&lt;br&gt;
    Our classic, default leveling algorithm as demonstrated in the
    &lt;a href="https://auphonic.com/audio_examples#leveler" target="_blank"&gt;Leveler Audio Examples&lt;/a&gt;.
    Use it if you are unsure.
&lt;/li&gt;
&lt;li&gt;
    &lt;b&gt;Foreground Only Leveler:&lt;/b&gt;&lt;br&gt;
    This preset reacts slower and levels foreground parts only.
    Use it if you have background speech or background music, which should not be amplified.
&lt;/li&gt;
&lt;li&gt;
    &lt;b&gt;Fast Leveler:&lt;/b&gt;&lt;br&gt;
    A preset which reacts much faster.
    It is built for recordings with fast and extreme loudness differences, for example, to amplify very quiet questions
    from the audience in a lecture recording, to balance fast-changing soft and loud voices within one audio track, etc.
&lt;/li&gt;
&lt;li&gt;
    &lt;b&gt;Amplify Everything:&lt;/b&gt;&lt;br&gt;
    Amplify as much as possible.
    Similar to the &lt;em&gt;Fast Leveler&lt;/em&gt;, but also amplifies non-speech background sounds like noise.
&lt;/li&gt;
&lt;/ul&gt;


&lt;h3&gt;Leveler Dynamic Range&lt;/h3&gt;

&lt;p&gt;
Our default &lt;a href="https://auphonic.com/help/algorithms/singletrack.html#adaptive-leveler" target="_blank"&gt;Leveler&lt;/a&gt;
tries to normalize all speakers to a similar loudness
so that a consumer in a car or subway doesn't feel the need to reach for the volume control. &lt;br&gt;
However, in other environments (living room, cinema, etc.) or in dynamic recordings, you might want &lt;b&gt;more level differences&lt;/b&gt; (Dynamic Range,
&lt;a href="https://auphonic.com/blog/2012/08/02/loudness-measurement-and-normalization-ebu-r128-calm-act/#lra" target="_blank"&gt;Loudness Range / LRA&lt;/a&gt;)
between speakers and within music segments.
&lt;/p&gt;
&lt;p&gt;
The parameter &lt;em&gt;Dynamic Range&lt;/em&gt; controls &lt;b&gt;how much leveling&lt;/b&gt; is applied:
Higher values result in more dynamic output audio files (less leveling). If you want to increase the dynamic range
by 3dB (or LU), just increase the &lt;em&gt;Dynamic Range&lt;/em&gt; parameter by 3dB.
&lt;br&gt;
We also like to call this &lt;b&gt;Loudness Comfort Zone&lt;/b&gt;: above a maximum and below a minimum possible level (the comfort zone), no leveling is applied. So if your input file already has a small dynamic range (is within the comfort zone), our leveler will be just bypassed.
&lt;/p&gt;
&lt;p&gt;
&lt;b&gt;Example Use Cases:&lt;/b&gt; &lt;br&gt;
Higher dynamic range values should be used if you want to keep more loudness differences in dynamic narration
or dynamic music recordings (live concert/classical). &lt;br&gt;
It is also possible to utilize this parameter to &lt;b&gt;generate automatic mixdowns&lt;/b&gt; with different loudness range (LRA) values for different target environments
(very compressed ones like mobile devices or Alexa, very dynamic ones like home cinema, etc.).
&lt;/p&gt;


&lt;h3&gt;Compressor&lt;/h3&gt;

&lt;p&gt;
Controls &lt;b&gt;Micro-Dynamics Compression&lt;/b&gt;: &lt;br&gt;
The compressor reduces the volume of short and loud spikes like "p", "t" or laughter
(&lt;a href="https://auphonic.com/blog/2011/07/25/loudness-normalization-and-compression-podcasts-and-speech-audio/#compressor" target="_blank"&gt;
short-term dynamics&lt;/a&gt;) and also &lt;b&gt;shapes the sound&lt;/b&gt; of your voice (it will sound more or less "processed"). &lt;br&gt;
The Leveler, on the other hand, adjusts
&lt;a href="https://auphonic.com/blog/2011/07/25/loudness-normalization-and-compression-podcasts-and-speech-audio/#mid-term" target="_blank"&gt;
mid-term level differences&lt;/a&gt;, as done by a sound engineer, using the faders of an audio mixer,
so that a listener doesn't have to adjust the playback volume all the time. &lt;br&gt;
For more details please see
&lt;a href="https://auphonic.com/blog/2011/07/25/loudness-normalization-and-compression-podcasts-and-speech-audio/" target="_blank"&gt;
Loudness Normalization and Compression of Podcasts and Speech Audio&lt;/a&gt;.
&lt;/p&gt;

Possible values are:
&lt;ul&gt;
&lt;li&gt;&lt;b&gt;Auto:&lt;/b&gt;&lt;br&gt;
The compressor setting depends on the selected &lt;em&gt;Leveler Preset&lt;/em&gt;.
&lt;em&gt;Medium&lt;/em&gt; compression is used in &lt;em&gt;Foreground Only&lt;/em&gt; and &lt;em&gt;Default Leveler&lt;/em&gt; presets,
&lt;em&gt;Hard&lt;/em&gt; compression in our &lt;em&gt;Fast Leveler&lt;/em&gt; and &lt;em&gt;Amplify Everything&lt;/em&gt; presets.
&lt;/li&gt;
&lt;li&gt;&lt;b&gt;Soft:&lt;/b&gt;&lt;br&gt; Uses less compression.&lt;/li&gt;
&lt;li&gt;&lt;b&gt;Medium:&lt;/b&gt;&lt;br&gt; Our default setting.&lt;/li&gt;
&lt;li&gt;&lt;b&gt;Hard:&lt;/b&gt;&lt;br&gt;
More compression, especially tries to compress short and extreme level overshoots.
Use this preset if you want your voice to sound very processed, our if you have extreme and fast-changing level differences.
&lt;/li&gt;
&lt;li&gt;&lt;b&gt;Off:&lt;/b&gt;&lt;br&gt;
No &lt;a href="https://auphonic.com/blog/2011/07/25/loudness-normalization-and-compression-podcasts-and-speech-audio/#compressor" target="_blank"&gt;
short-term dynamics compression&lt;/a&gt; is used at all, only
&lt;a href="https://auphonic.com/blog/2011/07/25/loudness-normalization-and-compression-podcasts-and-speech-audio/#mid-term" target="_blank"&gt;
mid-term leveling&lt;/a&gt;. Switch off the compressor if you just want to adjust the loudness range without any additional
micro-dynamics compression.
&lt;/li&gt;
&lt;/ul&gt;


&lt;h3&gt;Separate Music/Speech Parameters&lt;/h3&gt;

&lt;p&gt;
Use the switch &lt;em&gt;Separate MusicSpeech Parameters&lt;/em&gt; (top right), to see separate
&lt;a href="https://auphonic.com/help/algorithms/singletrack.html#adaptive-leveler" target="_blank"&gt;Adaptive Leveler&lt;/a&gt;
parameters for music and speech segments,
to  control all leveling details separately for speech and music parts:
&lt;/p&gt;

&lt;a href="https://auphonic.com/media/blog/LevelerMSParams.png"&gt;
&lt;img src="https://auphonic.com/media/blog/LevelerMSParams.png" width=600px"&gt;
&lt;/a&gt;

&lt;p&gt;
For &lt;b&gt;dialog intelligibility improvements in films and TV&lt;/b&gt;, it is important that the speech/dialog level and loudness range is not too soft compared to the overall
&lt;a href="https://auphonic.com/blog/2012/08/02/loudness-measurement-and-normalization-ebu-r128-calm-act/" target="_blank"&gt;programme level&lt;/a&gt; and
&lt;a href="https://auphonic.com/blog/2012/08/02/loudness-measurement-and-normalization-ebu-r128-calm-act/#lra" target="_blank"&gt;loudness range&lt;/a&gt;.
This parameter allows you to use more leveling in speech parts while keeping music and FX elements less processed. &lt;br&gt;
&lt;b&gt;Note:&lt;/b&gt; Speech, music and overall loudness and loudness range of your production are also displayed in our
&lt;a href="https://auphonic.com/help/web/audioinspector.html#audio-processing-statistics" target="_blank"&gt;Audio Processing Statistics&lt;/a&gt;!
&lt;/p&gt;

&lt;p&gt;
&lt;b&gt;Example Use Case:&lt;/b&gt; &lt;br&gt;
Music live recordings or dynamic music mixes, where you want to amplify
all speakers (speech dynamic range should be small) but keep the dynamic range within and between music segments
(music dynamic range should be high). &lt;br&gt;
Dialog intelligibility improvements for films and TV, without effecting music and FX elements.
&lt;/p&gt;


&lt;h3&gt;Other Advanced Audio Algorithm Parameters&lt;/h3&gt;

&lt;p&gt;
We also offer advanced audio parameters for our
&lt;a href="https://auphonic.com/help/algorithms/singletrack.html#noise-and-hiss-reduction" target="_blank"&gt;Noise&lt;/a&gt;,
&lt;a href="https://auphonic.com/help/algorithms/singletrack.html#hum-reduction" target="_blank"&gt;Hum Reduction&lt;/a&gt;
and
&lt;a href="https://auphonic.com/help/algorithms/singletrack.html#global-loudness-normalization-and-true-peak-limiter" target="_blank"&gt;Global Loudness Normalization&lt;/a&gt;
algorithms:
&lt;/p&gt;

&lt;a href="https://auphonic.com/media/blog/AdvancedAlgs2.png"&gt;
&lt;img src="https://auphonic.com/media/blog/AdvancedAlgs2.png" width="586px" style="border: 1px solid #ddd;"&gt;
&lt;/a&gt;

&lt;p&gt;
For more details, please see the
&lt;a href="https://auphonic.com/help/web/production.html#advanced-audio-algorithms" target="_blank"&gt;Advanced Audio Algorithms Documentation&lt;/a&gt;.
&lt;/p&gt;


&lt;h3&gt;Want to know more?&lt;/h3&gt;

&lt;p&gt;
If you want to know more details about our advanced algorithm parameters (especially the leveler parameters), please listen to the
following &lt;b&gt;podcast interview&lt;/b&gt; with &lt;em&gt;Chris Curran&lt;/em&gt; (Podcast Engineering School):  &lt;br&gt;
&lt;b&gt;&lt;a href="https://podcastengineeringschool.com/author/fractal41272pes/" target="_blank"&gt;Auphonic’s New Advanced Features, with Georg Holzmann – PES 108&lt;/a&gt;&lt;/b&gt;
&lt;/p&gt;


&lt;h3 id="beta"&gt;Advanced Parameters Private Beta and Feedback&lt;/h3&gt;

&lt;p&gt;
At the moment the advanced algorithm parameters are &lt;b&gt;for beta users only&lt;/b&gt;.
This is to allow us to get user feedback, so we can change the
parameters to suit user needs.
&lt;br&gt;
Please &lt;a href="https://auphonic.com/contact"&gt;let us know&lt;/a&gt; your case studies, if you need
any other algorithm parameters or if you have any questions!
&lt;/p&gt;

&lt;p&gt;
Here are some private beta invitation codes:
&lt;pre&gt;
jbwCVpLYrl 6zmLqq8o3z RXYIUbC6al QDmIZLuPKa JIrnGRZBgl SWQOWeZOBD ISeBCA9gTy w5FdsyhZVI qWAvANQ5mC twOjdHrit3
KwnL2Le6jB 63SE2V54KK G32AULFyaM 3H0CLYAwLU mp1GFNVZHr swzvEBRCVa rLcNJHUNZT CGGbL0O4q1 5o5dUjruJ9 hAggWBpGvj
ykJ57cFQSe 0OHAD2u1Dx RG4wSYTLbf UcsSYI78Md Xedr3NPCgK mI8gd7eDvO 0Au4gpUDJB mYLkvKYz1C ukrKoW5hoy S34sraR0BU
J2tlV0yNwX QwNdnStYD3 Zho9oZR2e9 jHdjgUq420 51zLbV09p4 c0cth0abCf 3iVBKHVKXU BK4kTbDQzt uTBEkMnSPv tg6cJtsMrZ
BdB8gFyhRg wBsLHg90GG EYwxVUZJGp HLQ72b65uH NNd415ktFS JIm2eTkxMX EV2C5RAUXI a3iwbxWjKj X1AT7DCD7V y0AFIrWo5l
&lt;/pre&gt;
We are happy to send further &lt;b&gt;invitation codes&lt;/b&gt; to all interested users - please do not hesitate to
&lt;a href="https://auphonic.com/contact" target="_blank"&gt;contact us&lt;/a&gt;!
&lt;/p&gt;

&lt;p&gt;
If you have an invitation code, you can enter it here to activate the advanced audio algorithm parameters:
&lt;br&gt;
&lt;b style="text-align: center; display: block;"&gt;&lt;a href="https://auphonic.com/private_beta" target="_blank"&gt;Auphonic Algorithm Parameters Private Beta Activation&lt;/a&gt;&lt;/b&gt;
&lt;/p&gt;

&lt;br&gt;
&lt;br&gt;
&lt;br&gt;
</description><author>grh@auphonic.com (Georg)</author><pubDate>Mon, 05 Nov 2018 11:42:22 +0000</pubDate><guid>https://auphonic.com/blog/2018/11/05/auphonic-leveler-customization/</guid><enclosure length="100000" type="image/png" url="http://auphonic.com/media/blog/AdvancedAlgsUpdate.png"/><category>Audio</category><category>Development</category></item><item><title>Resumable File Uploads to Auphonic
</title><link>https://auphonic.com/blog/2018/09/04/resumable-file-uploads/</link><description>
&lt;p&gt;
Large file uploads in a web browser are problematic, even in 2018.
If working with a poor network connection, uploads can fail and
have to be retried from the start.
&lt;/p&gt;

&lt;p style="text-align: center;"&gt;
&lt;img width="400" src="https://auphonic.com/media/blog/resumable_upload.png" /&gt;
&lt;/p&gt;

&lt;p&gt;
At Auphonic, our users have to &lt;b&gt;upload large audio and video files&lt;/b&gt;,
or multiple media files when creating a
&lt;a href="https://auphonic.com/help/web/multitrack.html" target="_blank"&gt;multitrack production&lt;/a&gt;.
To minimize any potential issues, we integrated various
&lt;a href="https://auphonic.com/help/web/services.html" target="_blank"&gt;external services&lt;/a&gt;
which are specialized for large file transfers,
like FTP, SFTP, Dropbox, Google Drive, S3, etc.
&lt;/p&gt;
&lt;p&gt;
To further minimize issues, as of today we have also released &lt;b&gt;resumable and chunked direct file uploads&lt;/b&gt;
in the web browser &lt;b&gt;to auphonic.com&lt;/b&gt;.
&lt;/p&gt;
&lt;p&gt;
If you are not interested in the technical details, please just go to the section
&lt;a href="#auphonic"&gt;Resumable Uploads in Auphonic&lt;/a&gt; below.
&lt;/p&gt;


&lt;h3 id="problem"&gt;The Problem with Large File Uploads in the Browser&lt;/h3&gt;

&lt;p&gt;
If using either mobile networks (which remain fragile) or unstable WiFi connections,
file uploads are often interrupted and will fail.
There are also many areas in the world where connections are quite poor,
which makes uploading big files frustrating.
&lt;/p&gt;

&lt;p&gt;
After an interrupted file upload, the &lt;b&gt;web browser must restart the whole upload from the start&lt;/b&gt;,
which is a problem when it happens in the middle of a 4GB video file upload on a slow connection.
&lt;br&gt;
Furthermore, the longer an upload takes, the more likely it is to have a network
glitch interrupting the upload, which then has to be retried from the start.
&lt;/p&gt;


&lt;h3 id="solution"&gt;The Solution: Chunked, Resumable Uploads&lt;/h3&gt;

&lt;p&gt;
To avoid user frustration, we need to be able to detect network errors and potentially resume an upload
without having to restart it from the beginning.
&lt;/p&gt;

&lt;p&gt;
To achieve this, we have to &lt;b&gt;split a file upload in smaller chunks&lt;/b&gt; directly within the web browser,
so that these chunks can then be sent to the server afterwards. &lt;br&gt;
If an upload fails or the user wants to pause, it is possible
to resume it later and only send those chunks that have not already been uploaded. &lt;br&gt;
If there is a network interruption or change, the upload will be retried automatically.
&lt;/p&gt;

&lt;p&gt;
Companies like Dropbox, Google, Amazon AWS etc. all have their own protocols and API's
for chunked uploads,
but there are also some &lt;b&gt;open source implementations&lt;/b&gt; available, which offer resumable uploads:
&lt;/p&gt;

&lt;dl&gt;
&lt;dt&gt;resumable.js [&lt;a href="http://www.resumablejs.com/" target="_blank"&gt;link&lt;/a&gt;]:&lt;/dt&gt;
&lt;dd&gt;
&lt;em&gt;"A JavaScript library providing multiple simultaneous, stable 
and resumable uploads via the HTML5 File API"&lt;/em&gt; &lt;br&gt;
This solutions is a JavaScript library only and requires that the
protocol is implemented on the server as well.
&lt;/dd&gt;

&lt;dt&gt;tus.io [&lt;a href="https://tus.io/" target="_blank"&gt;link&lt;/a&gt;]:&lt;/dt&gt;
&lt;dd&gt;
&lt;em&gt;"Open Protocol for Resumable File Uploads"&lt;/em&gt; &lt;br&gt;
Tus.io offers a simple, cheap and reusable stack for clients and servers (in many languages).
They have a blog with further information about resumable uploads,
see &lt;a href="https://tus.io/blog.html" target="_blank"&gt;tus blog&lt;/a&gt;.
&lt;/dd&gt;

&lt;dt&gt;plupload [&lt;a href="https://www.plupload.com/" target="_blank"&gt;link&lt;/a&gt;]:&lt;/dt&gt;
&lt;dd&gt;
A JavaScript library, similar to resumable.js, which requires a separate server implementation.
&lt;/dd&gt;
&lt;/dl&gt;

&lt;p&gt;
We chose to use resumable.js and developed our own server implementation.
&lt;/p&gt;


&lt;h3 id="auphonic"&gt;Resumable Uploads in Auphonic&lt;/h3&gt;

&lt;p&gt;
If you upload files to a &lt;a href="https://auphonic.com/help/web/production.html" target="_blank"&gt;singletrack&lt;/a&gt; or &lt;a href="https://auphonic.com/help/web/multitrack.html" target="_blank"&gt;multitrack&lt;/a&gt; production, you will see the upload progress bar
and a &lt;b&gt;pause button&lt;/b&gt;, which is one way to pause and resume an upload:
&lt;/p&gt;

&lt;p style="text-align: center;"&gt;
&lt;img width="580" src="https://auphonic.com/media/blog/auphonic_upload_progress.png" /&gt;
&lt;/p&gt;

&lt;p&gt;
It is also possible to close the browser completely or &lt;b&gt;shut down your computer during the upload&lt;/b&gt;,
then edit the production and upload the file again later.
This will just &lt;b&gt;resume the file upload&lt;/b&gt; from the position where it was stopped before.
&lt;br&gt;
(Previously uploaded chunks are saved for 24h on our servers, after that you have to start
the whole upload again.)
&lt;/p&gt;

&lt;p&gt;
In case of a &lt;b&gt;network problem&lt;/b&gt; or if you switch to a different connection,
we will &lt;b&gt;resume the upload automatically&lt;/b&gt;. &lt;br&gt;
This should solve many problems which were reported by some users in the past!
&lt;/p&gt;

&lt;p&gt;
You can of course also use any of our
&lt;a href="https://auphonic.com/help/web/services.html" target="_blank"&gt;external services&lt;/a&gt;
for stable incoming and outgoing file transfers!
&lt;/p&gt;


&lt;h3&gt;Do you still have Uploading Issues?&lt;/h3&gt;

&lt;p&gt;
We hope that uploads to Auphonic are much more reliable now, even 
on poor connections.
&lt;/p&gt;

&lt;p&gt;
If you still experience any problems,
&lt;a href="https://auphonic.com/contact" target="_blank"&gt;please let us know&lt;/a&gt;. &lt;br&gt;
We are very happy about any bug reports and will do our best to fix them!
&lt;/p&gt;

&lt;br&gt;
&lt;br&gt;
&lt;br&gt;
</description><author>grh@auphonic.com (Georg)</author><pubDate>Tue, 04 Sep 2018 09:39:53 +0000</pubDate><guid>https://auphonic.com/blog/2018/09/04/resumable-file-uploads/</guid><enclosure length="100000" type="image/png" url="http://auphonic.com/media/blog/resumable_upload.png"/><category>Development</category><category>News</category></item><item><title>Codec2: a whole Podcast on a Floppy Disk
</title><link>https://auphonic.com/blog/2018/06/01/codec2-podcast-on-floppy-disk/</link><description>
&lt;p&gt;
In a &lt;a href="https://auphonic.com/blog/2012/09/26/opus-revolutionary-open-audio-codec-podcasts-and-internet-audio/" target="_blank"&gt;previous blogpost&lt;/a&gt; we talked about the Opus codec, which offers very low bitrates. Another codec seeking to achieve even lower bitrates is &lt;a href="http://www.rowetel.com/?page_id=452" target="_blank"&gt;Codec 2&lt;/a&gt;.
&lt;/p&gt;
&lt;p&gt;
Codec 2 is designed for use with speech only, and although the bitrates are impressive the results aren’t as clear as Opus, as you can hear in the following &lt;a href="#codec2-examples"&gt;audio examples&lt;/a&gt;.
However, there is some interesting work being done with Codec 2 in combination with neural network (&lt;a href="https://deepmind.com/blog/wavenet-generative-model-raw-audio/" target="_blank"&gt;WaveNets&lt;/a&gt;) that is yielding &lt;a href="#wavenet-examples"&gt;great results&lt;/a&gt;.
&lt;/p&gt;

&lt;p style="text-align: center; font-size: small;"&gt;
&lt;img width="472" src="https://auphonic.com/media/blog/wavenet_layers.png" /&gt;
Layers of a &lt;a href="https://deepmind.com/blog/wavenet-generative-model-raw-audio/" target="_blank"&gt;WaveNet&lt;/a&gt; neural network.
&lt;/p&gt;


&lt;h3&gt;Background&lt;/h3&gt;

&lt;p&gt;
&lt;a href="http://www.rowetel.com/?page_id=452" target="_blank"&gt;Codec 2&lt;/a&gt; is an open source codec designed for speech, and aims for compression rates between 700bps and 3200bps (bits per seconds).
&lt;/p&gt;
&lt;p&gt;
The man behind it, &lt;a href="http://www.rowetel.com/?page_id=434" target="_blank"&gt;David Rowe&lt;/a&gt;, is an electronic engineer currently living in South Australia. He started the project in September 2009, with the main aim of improving low-cost radio communication for people living in remote areas of the world. With this in mind, he set out to develop a codec that would significantly reduce file sizes and the bandwidth required when streaming.
&lt;/p&gt;
&lt;p&gt;
Another motivation according to David, was to be &lt;b&gt;free from patented technologies&lt;/b&gt; used by closed source codes which he believes “require expensive and awkward licenses and are stifling innovation”. His belief is that this work can be done without requiring the use of patent protected codecs, so all his work is open source.
&lt;/p&gt;


&lt;h3&gt;Potential Applications&lt;/h3&gt;

&lt;p&gt;
Rowe’s perceived applications include VOIP trunking, voice over low bandwidth HF/VHF digital radio, (especially for amateur radio, so as to avoid issues with the use of proprietary codecs), and developing world and remote area communications, including military, police and emergency services.
&lt;/p&gt;
&lt;p&gt;
Why we’re interested here at Auphonic is for its potential for longer podcasts, presentations and audiobooks, allowing for &lt;b&gt;low storage&lt;/b&gt; and minimizing the effect of &lt;b&gt;bad network connections&lt;/b&gt;.
&lt;/p&gt;


&lt;h3&gt;How it Works&lt;/h3&gt;

&lt;p&gt;To achieve the lower rates sought, speech has to be reduced into the smallest possible information/data, and this means that the amount of redundant information that is transmitted has to be minimized.&lt;/p&gt;

&lt;p&gt;To do this, Codec 2 uses &lt;b&gt;harmonic sinusoidal speech coding&lt;/b&gt;. This splits the speech into 10 - 30ms segments, called frames. Each frame is then analysed for the fundamental frequency (or pitch), and the number of harmonics that fit into a 4Khz bandwidth. Further, for each of the harmonics within the 4khz range, the amplitude and phase are recorded. &lt;/p&gt;

&lt;p&gt;This information is then coded, and the decoder reconstructs the audio based on this data.&lt;/p&gt;

&lt;p style="text-align: center; font-size: small;"&gt;
&lt;a href="https://auphonic.com/media/blog/Codec2_Enc_Dec_Diagram.png" target="_blank"&gt;
&lt;img src="https://auphonic.com/media/blog/Codec2_Enc_Dec_Diagram.png" alt="Codec 2 encoding and decoding process" width="600" /&gt;
&lt;/a&gt;
Codec 2 Block diagrams - Encoder (left) And decoder (right)&lt;br/&gt;
Figure from &lt;a href="http://www.rowetel.com/?page_id=452#refs"&gt;Rowtel&lt;/a&gt;.
&lt;/p&gt;


&lt;h3 id="codec2-examples"&gt;Audio Examples and Comparison with other Codecs&lt;/h3&gt;
			  
&lt;p&gt;Whilst it all sounds great in theory, how does the reality match up? Let’s have a listen.

&lt;/p&gt;
&lt;p&gt;Here is a short wav audio file:&lt;/p&gt;  
	&lt;table&gt;
&lt;tr&gt;
    &lt;td style="text-align: center; font-size: small;"&gt;  
	  intro-orig.wav - 1.3 MB (&lt;a href="https://auphonic.com/media/audio-examples/codec2/intro-orig.wav"&gt;download&lt;/a&gt;): &lt;br/&gt;
&lt;audio controls="controls" preload="none" style="width: 50%;"&gt;
  &lt;source src="https://auphonic.com/media/audio-examples/intro-opus_024kbps.wav" /&gt;
&lt;/audio&gt;
	&lt;/td&gt;
		&lt;/tr&gt;
	  &lt;/table&gt;

&lt;p&gt;Applying Codec 2 (without the WaveNet decoder) at the different rates available, 3200bps, 2400bps,1600bps,1200bps and 700bps, we get:&lt;/p&gt;
	  
&lt;table&gt;
&lt;tr&gt;
    &lt;td style="text-align: center; font-size: small;"&gt;
3200bps (&lt;a href="https://auphonic.com/media/audio-examples/codec2/intro-orig_3200bps.c2.wav"&gt;download&lt;/a&gt;): &lt;br/&gt;
&lt;audio controls="controls" preload="none" style="width: 100%;"&gt;
  &lt;source src="https://auphonic.com/media/audio-examples/codec2/intro-orig_3200bps.c2.wav" /&gt;
&lt;/audio&gt;
    &lt;/td&gt;
    &lt;td style="text-align: center; font-size: small;"&gt;
2400bps (&lt;a href="https://auphonic.com/media/audio-examples/codec2/intro-orig_2400bps.c2.wav"&gt;download&lt;/a&gt;): &lt;br/&gt;
&lt;audio controls="controls" preload="none" style="width: 100%;"&gt;
  &lt;source src="https://auphonic.com/media/audio-examples/codec2/intro-orig_2400bps.c2.wav" /&gt;
&lt;/audio&gt;
    &lt;/td&gt;
&lt;/tr&gt;
&lt;/table&gt;
	  
	  &lt;table&gt;
&lt;tr&gt;
    &lt;td style="text-align: center; font-size: small;"&gt;
1600bps (&lt;a href="https://auphonic.com/media/audio-examples/codec2/intro-orig_1600bps.c2.wav"&gt;download&lt;/a&gt;): &lt;br/&gt;
&lt;audio controls="controls" preload="none" style="width: 100%;"&gt;
  &lt;source src="https://auphonic.com/media/audio-examples/codec2/intro-orig_1600bps.c2.wav" /&gt;
&lt;/audio&gt;
    &lt;/td&gt;
    &lt;td style="text-align: center; font-size: small;"&gt;
1200bps (&lt;a href="https://auphonic.com/media/audio-examples/codec2/intro-orig_1200bps.c2.wav"&gt;download&lt;/a&gt;): &lt;br/&gt;
&lt;audio controls="controls" preload="none" style="width: 100%;"&gt;
  &lt;source src="https://auphonic.com/media/audio-examples/codec2/intro-orig_1200bps.c2.wav" /&gt;
&lt;/audio&gt;
    &lt;/td&gt;
&lt;/tr&gt;
&lt;/table&gt;
	  	  &lt;table&gt;
&lt;tr&gt;
    &lt;td style="text-align: center; font-size: small;"&gt;
700bps (&lt;a href="https://auphonic.com/media/audio-examples/codec2/intro-orig_700bps.c2.wav"&gt;download&lt;/a&gt;): &lt;br/&gt;
&lt;audio controls="controls" preload="none" style="width: 50%;"&gt;
  &lt;source src="https://auphonic.com/media/audio-examples/codec2/intro-orig_700bps.c2.wav" /&gt;
&lt;/audio&gt;
    &lt;/td&gt;
	&lt;/tr&gt;
&lt;/table&gt;

&lt;p&gt;
These examples show significantly reduced file sizes. &lt;br&gt;
Putting that information more meaningfully in terms of &lt;b&gt;how much storage you would need for an hour of audio:&lt;/b&gt;
&lt;/p&gt;

&lt;ul&gt;	
	&lt;li&gt;At 3200bps, &lt;b&gt;1 hour&lt;/b&gt; of audio requires only &lt;b&gt;1.37MB&lt;/b&gt; (this would fit on one old 3½-inch floppy disk!)&lt;/li&gt;		
	&lt;li&gt;A rate of 2400bps equates to 1.03MB/h&lt;/li&gt;
	&lt;li&gt;A rate of 1600bps equates to 0.68MB/h (Or approximately &lt;b&gt;2 hours of audio on one floppy disk&lt;/b&gt;!)&lt;/li&gt;
	&lt;li&gt;A rate of 1200bps equates to 0.51MB/h&lt;/li&gt;
	&lt;li&gt;A rate of 700bps equates to 0.3MB/h&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;
So great compression, but the result is clearly not natural sounding.
&lt;/p&gt;
&lt;p&gt;As a comparison here is the same audio as a 8kb/s MP3:&lt;/p&gt;
	  
	  &lt;table&gt;
&lt;tr&gt;
    &lt;td style="text-align: center; font-size: small;"&gt;
MP3 at 8 kb/s -  23kb file size (&lt;a href="https://auphonic.com/media/audio-examples/codec2/intro-orig_8k.mp3"&gt;download&lt;/a&gt;): &lt;br/&gt;
&lt;audio controls="controls" preload="none" style="width: 50%;"&gt;
  &lt;source src="https://auphonic.com/media/audio-examples/codec2/intro-orig_8k.mp3" /&gt;
&lt;/audio&gt;
	&lt;/td&gt;&lt;/tr&gt;
			  &lt;/table&gt;

&lt;p&gt;The file size is significantly larger than Codec 2 and the quality is arguably still not useable. You can clearly hear what is sometimes called sizzle - the weird metallic sounds you hear on low quality MP3s.&lt;/p&gt;
	
&lt;p&gt;There is a final codec which is worth comparing, one that that seems to capture the two ideals of usable quality at low bitrates that we want: &lt;a href="https://auphonic.com/blog/2012/09/26/opus-revolutionary-open-audio-codec-podcasts-and-internet-audio/" target="_blank"&gt;Opus&lt;/a&gt;.
&lt;br&gt;
Because of it's convincing low-bitrate performance, Auphonic already offers Opus encoding all the way down to 6 kbps, the lowest bitrate that Opus supports.
&lt;/p&gt;
&lt;p&gt;Comparing Opus at this 6 kbps rate to the 8kbps MP3 shows a significant improvement -  although slightly muffled, it still sounds natural:
&lt;/p&gt;
			  
			  &lt;table&gt;
    &lt;td style="text-align: center; font-size: small;"&gt;
Opus at 6kbps (&lt;a href="https://auphonic.com/media/audio-examples/intro-opus_006kbps.wav"&gt;download&lt;/a&gt;):&lt;br&gt;
&lt;audio controls="controls" preload="none" style="width: 50%;"&gt;
  &lt;source src="https://auphonic.com/media/audio-examples/intro-opus_006kbps.wav" /&gt;
&lt;/audio&gt;
    &lt;/td&gt;
			  &lt;/table&gt;

&lt;p&gt;
Returning to Codec 2, and purely as s a bit of fun, here are some &lt;b&gt;samples of Codec 2 on music&lt;/b&gt;!  (Note that Codec 2 is not designed for music, it was only ever conceived for use on speech).
&lt;/p&gt;
	  
	  	  &lt;table&gt;
&lt;tr&gt;
    &lt;td style="text-align: center; font-size: small;"&gt;
Original file (&lt;a href="https://auphonic.com/media/audio-examples/codec2/music-test.wav"&gt;download&lt;/a&gt;): &lt;br/&gt;
&lt;audio controls="controls" preload="none" style="width: 100%;"&gt;
  &lt;source src="https://auphonic.com/media/audio-examples/codec2/music-test.wav" /&gt;
&lt;/audio&gt;
    &lt;/td&gt;
    &lt;td style="text-align: center; font-size: small;"&gt;
As a 8kbps MP3 (&lt;a href="https://auphonic.com/media/audio-examples/codec2/music-test_8k.mp3"&gt;download&lt;/a&gt;): &lt;br/&gt;
&lt;audio controls="controls" preload="none" style="width: 100%;"&gt;
  &lt;source src="https://auphonic.com/media/audio-examples/codec2/music-test_8k.mp3" /&gt;
&lt;/audio&gt;
    &lt;/td&gt;
&lt;/tr&gt;
&lt;/table&gt;
&lt;p&gt;I personally couldn’t listen to the MP3 at this rate, so let’s listen to what Codec 2 does!
&lt;/p&gt;

&lt;p&gt;Codec 2 at different bitrates:&lt;/p&gt;
&lt;table&gt;
&lt;tr&gt;
    &lt;td style="text-align: center; font-size: small;"&gt;
3200bps (&lt;a href="https://auphonic.com/media/audio-examples/codec2/music-test_3200bps.c2.wav"&gt;download&lt;/a&gt;): &lt;br/&gt;
&lt;audio controls="controls" preload="none" style="width: 100%;"&gt;
  &lt;source src="https://auphonic.com/media/audio-examples/codec2/music-test_3200bps.c2.wav" /&gt;
&lt;/audio&gt;
    &lt;/td&gt;
    &lt;td style="text-align: center; font-size: small;"&gt;
2400bps (&lt;a href="https://auphonic.com/media/audio-examples/codec2/music-test_2400bps.c2.wav"&gt;download&lt;/a&gt;): &lt;br/&gt;
&lt;audio controls="controls" preload="none" style="width: 100%;"&gt;
  &lt;source src="https://auphonic.com/media/audio-examples/codec2/music-test_2400bps.c2.wav" /&gt;
&lt;/audio&gt;
    &lt;/td&gt;
&lt;/tr&gt;
&lt;/table&gt;
	  
	  &lt;table&gt;
&lt;tr&gt;
    &lt;td style="text-align: center; font-size: small;"&gt;
1600bps (&lt;a href="https://auphonic.com/media/audio-examples/codec2/music-test_1600bps.c2.wav"&gt;download&lt;/a&gt;): &lt;br/&gt;
&lt;audio controls="controls" preload="none" style="width: 100%;"&gt;
  &lt;source src="https://auphonic.com/media/audio-examples/codec2/music-test_1600bps.c2.wav" /&gt;
&lt;/audio&gt;
    &lt;/td&gt;
    &lt;td style="text-align: center; font-size: small;"&gt;
1200bps (&lt;a href="https://auphonic.com/media/audio-examples/codec2/music-test_1200bps.c2.wav"&gt;download&lt;/a&gt;): &lt;br/&gt;
&lt;audio controls="controls" preload="none" style="width: 100%;"&gt;
  &lt;source src="https://auphonic.com/media/audio-examples/codec2/music-test_1200bps.c2.wav" /&gt;
&lt;/audio&gt;
    &lt;/td&gt;
&lt;/tr&gt;
&lt;/table&gt;
	  
	  	  &lt;table&gt;
&lt;tr&gt;
    &lt;td style="text-align: center; font-size: small;"&gt;
700bps (&lt;a href="https://auphonic.com/media/audio-examples/codec2/music-test_700bps.c2.wav"&gt;download&lt;/a&gt;): &lt;br/&gt;
&lt;audio controls="controls" preload="none" style="width: 50%;"&gt;
  &lt;source src="https://auphonic.com/media/audio-examples/codec2/music-test_1600bps.c2.wav" /&gt;
&lt;/audio&gt;
    &lt;/td&gt;
&lt;/tr&gt;
&lt;/table&gt;

&lt;p&gt;As you can hear, it is not suitable for this application at all!&lt;/p&gt;


&lt;h3 id="wavenet-examples"&gt;Codec 2 and WaveNet&lt;/h3&gt;

&lt;p&gt;As we have heard, despite the impressive bitrates achieved, the end result is not very natural sounding. &lt;br&gt;
However, where it starts to get more interesting is the work done by  W. Bastiaan Kleijn from Cornell University Library. He has been using with Codec 2 running at 2400bps on the coding side, but &lt;a href="http://www.rowetel.com/?p=5966"&gt;replaced the Codec 2 decoder with a WaveNet deep learning generative model&lt;/a&gt;
(for more informationsee the paper
&lt;a href="https://arxiv.org/pdf/1712.01120.pdf" target="_blank"&gt;Wavenet based low rate speech coding&lt;/a&gt;).
&lt;/p&gt;

&lt;p&gt;
Here are some samples from &lt;a href="https://storage.googleapis.com/downloads.webmproject.org/icassp2018/index.html" target="_blank"&gt;the authors&lt;/a&gt;:
&lt;/p&gt;

&lt;table&gt;
  &lt;tr&gt;
    &lt;th&gt;&lt;b&gt;Codec&lt;/b&gt;&lt;/th&gt;
    &lt;th&gt;&lt;b&gt;Male Example&lt;/b&gt;&lt;/th&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;Original File&lt;/td&gt;
    &lt;td&gt;&lt;audio preload="none" controlsList="nodownload" controls&gt;&lt;source src="https://storage.googleapis.com/downloads.webmproject.org/icassp2018/446-o.wav" type="audio/wav"&gt;&lt;/audio&gt;&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;Codec 2&lt;/td&gt;
    &lt;td&gt;&lt;audio preload="none" controlsList="nodownload" controls&gt;&lt;source src="https://storage.googleapis.com/downloads.webmproject.org/icassp2018/446-c.wav" type="audio/wav"&gt;&lt;/audio&gt;&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;With WaveNet Decoder&lt;/td&gt;
    &lt;td&gt;&lt;audio preload="none" controlsList="nodownload" controls&gt;&lt;source src="https://storage.googleapis.com/downloads.webmproject.org/icassp2018/446-w.wav" type="audio/wav"&gt;&lt;/audio&gt;&lt;/td&gt;
  &lt;/tr&gt;
&lt;/table&gt;

&lt;table&gt;
  &lt;tr&gt;
    &lt;th&gt;&lt;b&gt;Codec&lt;/b&gt;&lt;/th&gt;
    &lt;th&gt;&lt;b&gt;Female Example&lt;/b&gt;&lt;/th&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;Original File&lt;/td&gt;
    &lt;td&gt;&lt;audio preload="none" controlsList="nodownload" controls&gt;&lt;source src="https://storage.googleapis.com/downloads.webmproject.org/icassp2018/442-o.wav" type="audio/wav"&gt;&lt;/audio&gt;&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;Codec 2&lt;/td&gt;
    &lt;td&gt;&lt;audio preload="none" controlsList="nodownload" controls&gt;&lt;source src="https://storage.googleapis.com/downloads.webmproject.org/icassp2018/442-c.wav" type="audio/wav"&gt;&lt;/audio&gt;&lt;/td&gt;
  &lt;/tr&gt;
  &lt;tr&gt;
    &lt;td&gt;With WaveNet Decoder&lt;/td&gt;
    &lt;td&gt;&lt;audio preload="none" controlsList="nodownload" controls&gt;&lt;source src="https://storage.googleapis.com/downloads.webmproject.org/icassp2018/442-w.wav" type="audio/wav"&gt;&lt;/audio&gt;&lt;/td&gt;
  &lt;/tr&gt;
&lt;/table&gt;

&lt;p&gt;
Comparing to Codec 2 you can hear a &lt;b&gt;significant increase in quality&lt;/b&gt;, and if you compare to the original, there is not a significant decrease in quality.&lt;/p&gt;
David Rowe himself has stated that he considers the result to be &lt;b&gt;"a game changer for low bit rate speech coding"&lt;/b&gt; and “as good an an 8000bps wideband speech codec”.
&lt;/p&gt;


&lt;h3&gt;Conclusion&lt;/h3&gt;

&lt;p&gt;Whilst the (original) Codec 2 project represents very interesting work, it is limited, and the end result is not suited for podcasting. Also as we heard in the audio examples, it can only be used for voice recordings, and not music. 
&lt;/p&gt;
&lt;p&gt;
However, Codec 2 in combination with a &lt;b&gt;WaveNet decoder&lt;/b&gt; improves the quality a lot and the
low bitrate (2400bps) would be extremely interesting for &lt;b&gt;podcasts and audiobooks distribution&lt;/b&gt; as well:
&lt;b&gt;one hour of audio&lt;/b&gt; would require only &lt;b&gt;1.03MB&lt;/b&gt; of storage!
&lt;/p&gt;
&lt;p&gt;Auphonic will add support for Codec 2 &lt;a href="https://auphonic.com/help/web/production.html#output-files" target="_blank"&gt;output files&lt;/a&gt; when the WaveNet decoder is in a usable form. For now we have just added &lt;b&gt;support for Codec 2 input files&lt;/b&gt;.
&lt;/p&gt;

&lt;br&gt;
&lt;br&gt;
&lt;br&gt;
</description><author>stewart.cuthew@gmail.com (Stewart)</author><pubDate>Fri, 01 Jun 2018 09:28:53 +0000</pubDate><guid>https://auphonic.com/blog/2018/06/01/codec2-podcast-on-floppy-disk/</guid><enclosure length="100000" type="image/png" url="http://auphonic.com/media/blog/wavenet_layers.png"/><category>Audio</category><category>Development</category></item></channel></rss>