Making Happy Audio

levelator only window interfaceBack in the very very early days of podcasting, there was The Conversations Network, a non-profit headed up by Doug Kay. Doug’s idea was to create a series of podcasts by a global team of volunteers. Imagine the excitement of that age. This is around 2004/2005 when we were just realizing that we were in the wild wild west of broadcasting. We were talking about how the FCC couldn’t tell us what to do, how you could release shows any time you wanted to, no schedules needed. It was a crazy time.

But Doug realized that he had a problem with all these volunteers. They were doing interviews but the audio levels of their voice and the voices of their interviewees was all over the map. Doug dreamed up the idea of a tool that would make the levels the same across all interviews, and got developers, Bruce and Malcom Sharpe along with Norman Lorrain and Russell Heistuman to turn that vision a reality. They created a very simple tool called The Levelator. Volunteer reporters for the network were required to drop their uncompressed recording files onto The Levelator and it would smooth out the audio making it listenable.

The best thing that ever happened to podcasting was when Doug made The Levelator available for free for all podcasters. It became an integral part of every podcaster’s toolkit. Even if you thought things were fine, you’d still run it through The Levelator before publishing. You may have heard me refer to it from time to time, especially if you’ve ever been to the live show (Sunday nights at 5pm Pacific Time at podfeet.com/live. While creating the show, I’ll drop in a listener recording and I’ll play around with the volume a bit but as long as it’s pretty close, you’ll hear me say “The Levelator will fix it!”

Over the years, whenever I would listen to a podcast where I was struggling to hear one voice and then covering my ears because the next voice was too loud, I would write to the creators and beg them to use the Levelator. I remember working with the SMR Podcast guys, the Pocket Show’s Paul Wheatley (I miss that show!) and even the Wired.co.uk podcast’s Nate Lanxon, each of them getting a “talk” from me about The Levelator.

You’d think there’s no problem to be solved here but you’d be wrong. The problem is that Doug and his folks folded up shop in 2010 and abandoned The Levelator. The good news is The Levelator kept working. When it was last released, it was guaranteed to work on everything from Tiger to Lion (10.7). If you’ve lost track of the numbers, El Capitan was released this week as 10.11. We got lucky for 10.8, 10.9 and 10.10 but our luck started to run out on 10.11. Reports flew around the Internet this week that The Levelator didn’t work any more under El Capitan.

But then, a lifeline. chas_m posted in mac-forums.com that a “clever fellow noted that it might be a case of a missing file in /usr/local/lib in El Capitan”. I don’t know who the “clever fellow” was, but chas_m compared the Yosemite directory and the El Capitan directory, and took a copy of the libsndfile.1.dylib file out of the Levelator package and placed it in /usr/local/lib. He restarted his Mac to allow the file to load, and it fixed the issue. Many people have used this fix, but it isn’t exactly a user-friendly solution, right?

I told you this entire back history because I’ve abandoned The Levelator (hey, he left me first). With some help from my friends, I’m in a better place now. But again, some more back story!

A few months back Victor Cajiao of the TerraTech podcast started talking to me about another problem, and that’s the problem of the loudness variation between podcasts. You know the problem, you’re doodling along on your bicycle with an awesome playlist of podcasts and suddenly the next one is way too quiet. You crank up the volume but the next show blows your eardrums out. Paul Figgiani from producenewmedia.com, a good friend of Victor’s is leading the charge to fix this problem.

I’m not a specialist in this field but from what I have learned from Paul and Victor, loudness standards were set for television a while back and that made our lives more pleasant. Not universally obeyed but way better than it used to be. Paul wants to bring this to podcasting and I’m all for it as a consumer of the medium. However, when Paul talked about it for podcasters, it was WAY WAY WAY too hard to understand how to comply with this mythical loudness standard of which he speaks. I’m pretty geeky but it was beyond me how to comply. I told Victor I was on board, as soon as there was a button I could push that would make my podcast comply with the standards.

euphonic logoAbout a year passed, and just recently Victor told me that Paul told HIM about a tool that would finally give me that one button push to comply with the loudness standards. Better yet, it would also level the audio much like what The Levelator has been doing all these years. The app is called Auphonic Leveler from auphonic.com. Georg, the developer, has an interesting business model for Auphonic. You can use it as an online service where you upload your files to their server and run the tool against it and get delivered a processed file, or you can buy the tool for your desktop.

auphonic_main_windowThe online tool is free if you only need two hours a month of audio processed, but they have pretty reasonably priced options for more time, like $11/month for 9 hours of audio processing or you can buy credits in packs and use when you need them. The credits are probably a good deal because you can use the 2 hours free first and your credits after that.

I don’t like monthly payments though, so I bought the standalone tool. You’re going to cough at the price – it’s $89 for personal and non-commercial use. I do make a very small profit off the NosillaCast (especially now that I can’t throw a party for the NosillaCastaways every year at Macworld) so I wrote to the developer, Georg, who said that it was fine for me to buy the personal license. They have a commercial license that’s $349 for up to five users.

auphonic_algorithmAt first I had buyer’s remorse about the $89, but as I worked through how to use the tool with Victor I realized how much Auphonic actually does. As mentioned it does the loudness compliance and leveling, but it will also convert the file to an mp3 so I don’t have to go through iTunes to do that. It will also processor audio to remove noise and hum. I have pretty clean audio but once in a while there’s a problem with a recording from someone else. Last week, George from Tulsa’s recording had a rumble underneath his audio that wasn’t bad but when I heard it during the live show as I assembled the recordings in GarageBand, I thought I’d give Auphonic a test real time. I dropped his original wav file into Auphonic, clicked the box to denoise/dehum and left it set to “auto” and in just a few seconds I had a beautifully clean recording. I’ve tried to use denoise filters before and the recording ALWAYS sound weird afterwards, but not with Auphonic, it was perfect!

Steve has been helping me try to perfect the settings in Auphonic, starting with what Victor taught me. Steve noticed two weeks ago that my audio sounded harsher than it used to and we think it’s because I have a compressor on my voice in Audio Hijack as it’s being recorded, but Auphonic is applying a peak limiter after the fact as well. Both of these tools help keep the maximum peak level of my voice from going too high and we think they were fighting with each other. I’m not sure we’ve got it perfect just yet but we’re working on it.

I wanted to tell you all about this not because I expect you to go out and become a podcaster (it would be awesome if you did) but rather because I think it’s an important lesson to think about the tools you love and use and have a plan for what you’ll do if they go away. I ripped the bandaid off quickly when Apple abandoned Aperture and I was rather pleased with myself that I’d converted to Auphonic before The Levelator died.

My next project is to figure out how to replace GarageBand 6.0.5 in my workflow. I really do need to get off of it since it’s been abandoned for more than a year now with the new and totally useless for podcasting version! And finally, the best part, this massive change in my workflow means that Dorothy has something new to script for me!

6 thoughts on “Making Happy Audio

  1. Joe C. Hecht - October 5, 2015

    In the year 2015, we can denoise photos and video to perfection (using open source solutions), and audio has been left in the dirt?

    So so sad… it’s not rocket science. I wonder why there is not more “leveling” code out there (and seeing the $$$, it might not be long before there is).

    But I do have a question about the product (or is it a service)… How do they handle potential licensing issues with formats such as AAC? Does a server based encoder software that profits as a “service” qualify as an encoder product? I’m no lawyer, but I have a real hard time with licensing when I want to use certain encoders (those licenses can cost a mint for commercial use).

    Seems to me if you use the “service?” then you are really just running an encoder “product” (hardware/software) remotely.

    I know a lot of license agreements forbid that sort of thing (a service).

    For example, I cannot set up a compiling service for “Delphi Code” that optimized the output.

    I wonder how that works in the ‘ol licensing department. There is a lot reasons why FFMpeg is distributed the way that it is, and the answer to that one sure fits in the same bucket.

    Joe

  2. Joe C. Hecht - October 18, 2015

    Well, we have been considering this for awhile, and due to your post, we sprung for a commercial license for both desktop versions (leveler and multitrack). It is pretty good, but does require small tweaks. The Mac version crashes a bit more than the windows version.

    Someone was talking about the format of the mp3s produced. I would highly reccommend using a lossless format for both input and output, then encode to your liking for final distribution. That way, you maintain total control, and can also losslessly edit the output (it ain’t always perfect and may even add add a hiccup or two.

    As far as the old levelstor, its still a good tool, runs under Windows (you really should have a VM sitting around for these sort of things*), and it sounds like a fix is just but a small shell script away!

    But at the end of the day, we are all happy, the audio from one source has improved a lot, and Auphonic made the payroll from the sale at your recommendation.

    * It really does not matter what your personal bias is for an operating system, but if you work in computers, you probably should have the big three installed (Mac, windows, and Linux) for flexability and hands on experience. There is nothing wrong with having a couple of VM’s and using the best tools for the job, no matter what OS it uses.

  3. podfeet - October 18, 2015

    Glad you like Auphonic Joe. I have been using it for weeks now and never had a single crash, so maybe there’s some software you’re running that I’m not that’s interfering.I’ve run it under Yosemite and El Capitan without issue.

    Your use of the word “should” is problematic to me. Perhaps you meant “could benefit from” or “would find value if” I installed a Windows VM?

    I have run Windows and Linux VMs on my Macs but to date I haven’t found something I couldn’t do on my Mac. I love experimenting but not useful so far. I’m not saying you don’t find value but so far I haven’t found anything super useful.

  4. Joe C. Hecht - October 18, 2015

    I’m not sure of the context of “should” (there where two) that was problematic, and one was “probably should”.

    I will admit, I “probably should” be better with my choice of words 🙂

    I know everyone has an OS bias, but it’s just an OS, and it only there to run the programs you actually use to get work done. That said, there are a huge numbers of programs that are either only available on one OS, or that work better or have features that are only available on a given OS. I’m not picky about the OS, I just want to get the best results in the shortest amount of time.

    I have tons of Window software that just runs circles around what is available
    on the Mac, or can do things that I cannot do on the Mac. And visa-versa. And Linux too.

    But I only know that from the experience gained from not narrowing my choices to one OS.

    When I obtain software, I ask not “what is available for my Mac” but “what is available to get the job done, what are it’s features, and what is the cost”. I have all three OS’s running on my Macs, so what does the OS really matter? It’s just provides a window afterall.

    For example, I have the OSX Server, and it’s junk. So is Microsoft IIS. Unless I have something IIS specific, serving is a job for Linux. Video editing? Windows (hands down). Audio editing (Mac, hands down). Skype under Linux is not that great. Window’s is OK, and it is outstanding on the Mac. VMWare on Linux is OK, good under Windows, and outstanding under Fusion on the Mac. You would be very hard pressed to see me using WireCast on any platform. My Video mixing is done in Windows (but the video source often comes from the Mac where it just look a lot better). I could go on and on.

    Of course, I’m a coder by trade, and develop code for all the platforms. I see a lot, but what I really notice is from the social network posting that clearly show that the world is expanding onto all three platforms. Linux used to be a word only a few knew about, Mac users where a small minority, and PC users clearly ruled.

    Today, even young kids know this stuff… Chrome OS too.

    So sure, if you work in computers, you should probably have the big three OS’s installed, and be reasonably well versed in all of them, if for nothing else but to stay caught up.

    And if you really do that, in time, you will start to find things you can do better with software on one OS than another.

    That’s just my 2 cents. I don’t really have a bias for an OS (probably because I really don’t like any of them). I just want to get the job done.

  5. Joe C. Hecht - October 18, 2015

    Allison Said: “I have been using it for weeks now and never had a single crash, so maybe there’s some software you’re running that I’m not that’s interfering. I’ve run it under Yosemite and El Capitan without issue.”

    That’s sounds like an excuse from the Windows 3.0 days (where that was a common
    occurrence).

    I am not sure how that would work on a modern 64 bit operating system shored up with a good deal of sandboxing (and based on Unix at that).

    Let’s put the blame squarely where it belongs, instead of blaming Apple for allowing a Windows problem from back in the year 1992.

    The crash dialog is the operating system saying “The program is trying to access the memory outside of what is allowed and I won’t allow that to happen”.

    Good! The OS did it’s job, and isolated one program from stepping on something else (ie: keeping programs from interfering with one another).

    OK, I fed it a wave file that it croaked on halfway though the processing. And it is repeatable on more than one mac. A coding mistake is clearly the issue.

    The program mainly relies on free 3rd party library code (from the Linux world) to get the job done, and it all too easy for that code to cause a crash by whats called a wild pointer (the program accidently trying to access memory outside of what it is allowed to).

    Simply put, somewhere, the program is trying to read or write to the address of a memory location it does not own (ie: bad coding).

    That’s what the OS told us. But should it have happened that way? No. The folks who coded the app left out: “Try {ToDoSomething} Catch {OopsItBlewUp – TellTheUser}”.

    Basically, they have a easy way to catch a problem, but (because someone was lazy), they left it out.

    The app should have announced “hey, there was a problem processing the file – could you send it to us?”, then cleanly closed down.

    There ya go. And with a little effort, and I can even find the place that it blows (some info is announced in the dialog), and tell why it blows by doing a debug trace of the code.

    I do know a bit about linking up with some of that 3rd party code (I do that for a living), and as I originally pointed out, I do wonder how the licensing works out for one or more of those formats (originally I wondered about using it in a commercial service). For us, it’s a real pain to interface with some of that code in a close source commercial desktop app (not a service) and still keep the legal department smiling.

    Joe

  6. […] in 2015, I wrote an article entitled Making Happy Audio in which I walked through the different technologies that I was using to create listenable […]

Leave a Reply

Your email address will not be published. Required fields are marked *

Scroll to top