Cleaning Up Noisy Recordings
Trevor Wishart (July 2007)
Introduction
The 'Cleaning Kit' was originally developed to clean up vocal recordings
made in noisy environments.
Typical sources of noise might be ...
- Relatively Constant (e.g. extractor fan or air-conditioning
hum)
- Sustained but Changing (e.g. a passing motor vehicle)
- Intermittent (creaks of furniture, clothing noise)
- Intermittent but Complex (e.g. other people talking)
The Cleaning Kit uses some existing and some new CDP programs to tackle all
of these situations. However, it is not possible to deal with all possible noise
intrusions.
At present, the Cleaning Kit handles only mono files. To clean
stereo or multichannel files you will need to split the source into its
constituent channels and clean each channel separately.
The Cleaning Kit Framework
The Main Cleaning Kit window presents
- A list of Cleaning Tools to apply to
your source.
- A soundfiles window for listing soundfiles, where your file to be
cleaned will appear.
Typically, cleaning a source will involved several
passes. As you proceed, the soundfiles generated at each cleaning stage will
be listed in the window. You can select (by clicking on a file in the window)
which of these sounds you wish to clean (e.g. you can go back to the output of
the penultiamte cleaning procedure, if the last cleaning operation proved
unsatisfactory).
The window defaults to chosing the output of the last
cleaning operation.
- An output filename box, where you can choose a name for the final,
completely cleaned sound.
Often you will wish to overwrite the original
source, so there is a button to put the name of the source file in the
output-filename box.
There are currently 13 options for cleaning sources.
You can call any one of these
options br pressing the button adjacent to its name.
When a tool is called, a
new window will appear and you will be asked to enter relevant parameters for
that Cleaning process.
The 13 Cleaning options are
- Clean General Noise
This would typically be used to
remove constant background sounds like air-conditioning or
extractor-fan hum.
- Gate out Bad Signal
This replaces unwanted sound by silence
of the same duration. This can be used, for example
- To remove intakes of breath, or unwanted glossalalia.
- To remove any residual 'flutterings' from the Clean General Noise
process.
- To remove low level, but complex, background sounds, like speech, where
the principal source pauses (even momentarily). Particularly useful in
situations where the foreground sound you want is loud enough to mask the
background sounds most of the time, except when it pauses.
- Cut away Bad Signal
This can be used, for example
- To remove loud, short, intrusive sounds (from door-slams to clicks and
clinks).
- To remove hesitations.
- Reinsert Original Signal
- Remove Pitch in Signal
This is a spectral subtraction
process, and can be used, for example
- To remove clearly pitched electronic tones from gadgets used in the
environment.
- To remove residual pitches left behind by the Subtract Spectrum
or Clean General Noise options.
- To remove an interrupting voice, where both the principal speaker, and
the interrupting speaker, are speaking vowels, and each has a clear but
different pitch. This sometimes works perfectly and at other times fails
completely!
- Remove Sound under Sibilant
This is a filtering process, and
can be used to remove intrusive sounds where the foreground speaker is using a
sibilant
Typical intrusions might be
- the hum of passing traffic.
- a 2nd voice using a vowel.
- Remove Noise Above Signal
This is a filtering process, and
can be used to remove noise type intrusions
Typical intrusions might
be
- clothing noise.
- a 2nd voice using a sibilant.
This often causes some degradation of the principal source at the point where
the filtering is done.
It is usually better to use spectral subtraction if
this is possible at the particular time in the source.
- Low Frequency Problem
This is a filtering process, and can
be used to remove low frequency intrusions
Typical intrusions might be
- Footfalls or falling objects on carpeted floors.
- physical knocks on microphones or stands.
- wind/breath noise on a microphone.
- Subtract Spectrum
This is typically useful where a noise
intrusion begins, ends, or overlays a time where the foreground source is
silent, but then continues over the foreground source. You are asked to
specify the place where the intrusion is not overlaying the principal
source as well as the whole area to be cleaned. The process works by
subtracting the former from the latter. It can sometimes be used instead of
filtering or gating.
- Graft segment Elsewhere
This is a last resort process, where
nothing can be done to remove a noise intrusion. It allows you, for example,
to cut a sliver of a sybillant from one part of the recording, and use that to
replace a noise-intruded sibilant elsewhere.
- Insert Silence
Cutting away bad signal can sometimes result
in unnatural speech (or other) rhythms. These can sometimes be mitigated by
inserting an appropriate durations of silence.
- Attenuate Level
This can be used, for example, to compensate
for the source's back and forth movement in front of the microphone.
- Dovetail the Ends
This can be useful where a section of
recording has been cut from a longer source. For example, a single voice is
extracted from a conversation and the 2 speakers overlap at the boundary but,
to preserve the sense of the speech, the overlapped segment has to be
retained. Using other processes (pitch or sibilant removal, see above) may
still leave some residual unwanted signal. This can sometimes be hidden by
subtlely dovetailing the start (or end) of the recording.