AI-powered voice denoiser, optimized for Apple Silicon

For current or would-be users of Apple Mac computers, with answers to many FAQs.

Re: AI-powered voice denoiser, optimized for Apple Silicon

Post by iansampson »

Wonks wrote: Sat Jan 21, 2023 12:09 pm I was thinking more along the lines of a built-in processing-tool that comes with a DAW, so it wouldn't need to be real time and latency wouldn't be an issue.

Ah yes, a built-in offline processor for Logic would definitely come in handy. I tend to keep all my edited/assembled takes as separate regions so I can always go back and change things later — it’d be very convenient to process those all together without having to export anything. As you say, though, there’s no way to add extensions (besides real-time plugins) without going through Apple, and I have no idea how you’d even approach them about that. So for now, yes, a standalone app may well be the best way to go.
iansampson
Poster
Posts: 17 Joined: Tue Jan 17, 2023 8:47 pm

Re: AI-powered voice denoiser, optimized for Apple Silicon

Post by iansampson »

James Perrett wrote: Sat Jan 21, 2023 12:46 pm I keep all my processed files as FLAC in order to reduce the space needed although, as you are Apple based, I guess ALAC would be better for you.

Ah, that’s good to know. The app can import FLAC and ALAC already, but only exports .wav and .aif — I’ll work on adding support for losslessly compressed formats to a future release.

James Perrett wrote: Sat Jan 21, 2023 12:46 pm I’ve tried a couple of Spleeter based VST plug-ins but they really haven't been very practical.

Out of curiosity, what was it about the Spleeter-based plugins that made them impractical? Was the audio quality not good enough, or is a plugin just not the right tool for that sort of thing?

James Perrett wrote: Sat Jan 21, 2023 12:46 pm For this sort of work I don't think you would lose much by keeping it as a simple standalone app.

So far I’ve kept things as simple as can be: a minimal drag-and-drop interface, where you drop audio files onto the app and it saves denoised versions to a specified folder. More like a batch processor, I suppose, since you can do several at a time. Ultimately, I’d like to move towards a document-based approach, where you open an audio file and see the waveform and/or spectrogram, with the ability to process the whole thing or just selected regions. Turns out spectrograms are pretty tricky to implement efficiently, though, so a full-blown editor like that may have to wait a little.
iansampson
Poster
Posts: 17 Joined: Tue Jan 17, 2023 8:47 pm

Re: AI-powered voice denoiser, optimized for Apple Silicon

Post by James Perrett »

iansampson wrote: Tue Jan 24, 2023 11:30 pm Out of curiosity, what was it about the Spleeter-based plugins that made them impractical? Was the audio quality not good enough, or is a plugin just not the right tool for that sort of thing?

They were very processor hungry and weren't very stable. While my computer is fairly old, it is still reasonably powerful compared to many so I would expect it to be able to run most things - just maybe only a single instance rather than 2 or 3 that a modern computer could run.

iansampson wrote: Tue Jan 24, 2023 11:30 pm Ultimately, I’d like to move towards a document-based approach, where you open an audio file and see the waveform and/or spectrogram, with the ability to process the whole thing or just selected regions. Turns out spectrograms are pretty tricky to implement efficiently, though, so a full-blown editor like that may have to wait a little.

In my experience, my old copy of Adobe Audition does spectrograms far better than Izotope RX does. RX always redraws the spectrum from the left hand side which is very annoying when you are scrolling through a file because you have to wait for the whole screen to redraw before you see the new part. Audition seems smarter because it doesn't seem to redraw the whole spectrum all the time - only the parts that have newly come into view.

It may be worth exploring the Cockos WDL library - they certainly have FFT functions - I don't know if they've included the code they use to do spectral displays in Reaper.

https://www.cockos.com/wdl/
User avatar
James Perrett
Moderator
Posts: 16341 Joined: Mon Sep 10, 2001 12:00 am Location: The wilds of Hampshire
JRP Music - Audio Mastering and Restoration. JRP Music Facebook Page

Re: AI-powered voice denoiser, optimized for Apple Silicon

Post by iansampson »

Sorry for the hiatus! Had my nose to the grindstone the last couple months getting the app ready for public release. For anyone that’s interested, Luke Wood just published a great write-up about it over in the news section.

James Perrett wrote: Wed Jan 25, 2023 12:21 am
iansampson wrote: Tue Jan 24, 2023 11:30 pm Ultimately, I’d like to move towards a document-based approach, where you open an audio file and see the waveform and/or spectrogram, with the ability to process the whole thing or just selected regions. Turns out spectrograms are pretty tricky to implement efficiently, though, so a full-blown editor like that may have to wait a little.

In my experience, my old copy of Adobe Audition does spectrograms far better than Izotope RX does. RX always redraws the spectrum from the left hand side which is very annoying when you are scrolling through a file because you have to wait for the whole screen to redraw before you see the new part. Audition seems smarter because it doesn't seem to redraw the whole spectrum all the time - only the parts that have newly come into view.

As for spectrograms — to revive a two-month-old topic! — I’m still working on it, but taking the Audition-like approach that James recommends here, i.e. redrawing only the parts that come into view. Easier said than done, though, if you want the spectra to follow a non-uniform scale (like human hearing — vs. FFT bins which are linear along the frequency axis) and be resizable, etc. For now, the app is still a batch processor with a simple drag-and-drop interface, but I do hope to get the spectrogram editor working in the near future.
iansampson
Poster
Posts: 17 Joined: Tue Jan 17, 2023 8:47 pm

Re: AI-powered voice denoiser, optimized for Apple Silicon

Post by resistorman »

Sure could have used this 6 weeks ago :lol: Congrats...
User avatar
resistorman
Frequent Poster
Posts: 2926 Joined: Sun Nov 22, 2015 12:00 am Location: Asheville NC
"The Best" piece of gear is subjective.

Re: AI-powered voice denoiser, optimized for Apple Silicon

Post by iansampson »

resistorman wrote: Wed Mar 22, 2023 7:46 pm Sure could have used this 6 weeks ago :lol: Congrats...

Haha thanks! If only I’d known, I’d have finished it sooner :p.
iansampson
Poster
Posts: 17 Joined: Tue Jan 17, 2023 8:47 pm

Re: AI-powered voice denoiser, optimized for Apple Silicon

Post by Wonks »

Well done, Ian.
User avatar
Wonks
Jedi Poster
Posts: 18655 Joined: Thu May 29, 2003 12:00 am Location: Reading, UK
Reliably fallible.

Re: AI-powered voice denoiser, optimized for Apple Silicon

Post by iansampson »

Thanks! Really appreciate all the folks who posted earlier in this thread — gave me a much-needed push to finally get this done.
iansampson
Poster
Posts: 17 Joined: Tue Jan 17, 2023 8:47 pm
Post Reply