Musical Effect | FOCUS Module(s) |
---|---|
internal glissandos | ACCUMULATE, SUPERACCU |
tonal changes | EXAG, FOCUS, FOLD, SPECFOLD |
sample-hold | FREEZE, HOLD, STEP |
The 'typical command lines' shown below use parameters described in the text and should produce good results, although exactly what happens is dependent on the nature of the input sound. The complex sound used was that of a strum on a 3 ft spring, 1 cm diameter. The pitched sound was a sustained French horn tone.
Return
to Spectral Index
Typical command line:
focus accu raspdt.ana raspdtacc.ana -d0.75 -g-0.9
inanalysisfile input analysis file
outanalysisfile output analysis file
-ddecay sustained channel data decays by a factor of decay each second
(Possible Range: 0.001000 to 1.0. Default is 1.0. Suggested effective Range: 0.000001 to 0.5)-gglis sustained channel data glissandos at glis octaves per second
(Approximate Range: -11.7 to 11.7. Default is 0)
Frequencies are sustained into subsequent windows. The overall effect is one of sustaining, but one which also makes the spectrum more complex.
The gliss parameter appears to change the result the most. It produces glissandos within the spectrum of the sound. Very effective slow glissandos are produced when gliss is near 0, e.g., -0.9 or 0.1. At 0.5, there are several glissandos, at 1, they are fairly fast, and at 10 it becomes a wash.
The range allowed for gliss is really very extreme, especially in the negative range below zero. Values e.g., of -5.0 or less lead to the sound settling down to a steady, sustained pitch after an initial short glisssando. Recommended effective range is from -0.5 to 10.
The delay parameter is important for making the glissandos more perceptible. A low value of e.g., 0.1 is really very dry, whereas something like 0.75 brings out the glissandos nicely, with a kind of reverberant aura.
The main type of output from this function creates a supple, internally modulating sound, moving slowly at the low end of the gliss range and increasingly quickly as one moves towards the high end.
With a clearly pitched input, the results are aurally clearer, but otherwise similar. TIMESTRETCH applied to these results produces nicely slow-moving timbral and pitch changes.
SEE ALSO: SUPERACCU,
HILITE ARPEG.
Churning, bubbly textures can also be achieved with
BLUR SCATTER (high values for
keep) and HILITE TRACE (low
value for channels).
End of FOCUS ACCU
Return to List of functions to FOCUS spectral
features at top of this file
Return
to Main Index for the CDP System.
Return
to Spectral Index
Typical command line:
focus exag raspdt.ana raspdtexg.ana 0.5
inanalysisfile input analysis file
outanalysisfile output analysis file
exaggeration < 1 widens troughs and narrows formants (focuses on the peaks); > 1 narrows troughs and widens formants (diffusing the peaks). Range: 0.00100 to 1000.
exaggeration may vary over time
FOCUS EXAG plays with the frequency focus of regions of high amplitude among the partials of a spectrum. A spectral peak is an area of high amplitude among the frequencies of the spectrum. A trough is an area of low amplitude between spectral peaks. Formants are slightly more than spectral peaks: i.e., they are the area around a spectral peak, and might be narrow, wide etc.).
- Narrowing the formants (< 1) increases the frequency focus of each peak area, and therefore acts like an increasingly tight filter.
- Widening the formants (> 1) reduces the frequency focus of the formants, and therefore makes the sound increasingly fuzzy.
FOCUS EXAG can be used to make a sound thinner and pitched, just as a bandpass filter can create pitches within a complex sound. Working in the spectral domain, it does this by manipulating frequency regions of high amplitude, otherwise known as 'formants'. These can be visualised as little mountain peaks. 'Narrowing' the formants means making the 'mountains' narrower and its sides steeper; furthermore, the material inbetween virtually disappears. 'Widening' the formants means making the mountainsides slope more gradually, therefore encompassing more of the surrounding region. Because this is actually moving the position of partials, this results in timbral change and the addition of noise elements, hence the fuzziness refered to above.
I have found that a value of exag = 0.5 maintains a nice balance between the original and the treated sound. Lower values start to produce results reminiscent of BLUR TRACE, while higher values, e.g., 10, produce quite a fuzzy, noisy result, at least with a complex sound as input. 5 with a pitched sound produced a light buzz sheen on it.
It can also be used in connection with the extraction of a spectral envelope in order to alter the contour shape.
SPECFNU: NARROW FORMANTS (Mode 1)
SEE ALSO: FORMANTS GET,
FORMANTS PUT and
HILITE TRACE.
End of FOCUS EXAG
Return to List of functions to FOCUS spectral
features at top of this file
Return
to Main Index for the CDP System.
Return
to Spectral Index
Typical command line:
focus focus raspdt.ana raspdtfoc.ana -f1 16 0.09 -b20 -t20000
inanalysisfile input analysis file
outanalysisfile output analysis file
-f extract formant envelope linear frequency-wise, using 1 point for every N equally-spaced frequency-channels
-p extract formant envelope linear pitchwise, using N equally-spaced pitch-bands per octave
-i quicksearch for formants (less accurate)
pk (maximum) number of peaks to find Range 1 16
bw bandwidth of peak-centred filters, in octaves
-bbt bottom frequency at which to start the search for peaks
-ttp top frequency at which to end the search for peaks
-sval the number of windows over which the peaks are averaged. This is an attempt to retain only peaks which are STABLE over time. Range: 2 4097. Default is 9.
bw (bandwidth), bt (bottom frequency) & tp (top frequency) may vary over time
The program searches out the most prominent and persistent spectral peaks in a sound, creates a set of filters centred on those peaks, and then filters the sound with those filters. This focuses the spectral energy of the sound around those persistent spectral peaks.
The channels and bandwidth parameters are both the key sensitive areas affecting the outcome. Increasing the number of channels moves the output towards a BLUR TRACE result, while wider bandwidths, as one would expect, 'let through' more of the original sound.
I found that there was a good balance between original and processed sound with these parameter settings: Channels = 1, Peaks = 16, Bandwidth = 0.09, Lofrq = 20 and HiFrq = 20000.
The results depend very much on the spectral configuration of the source, so this function needs to be explored on a 'try it and see' basis. On the whole, it appears to be an alternative approach to BLUR TRACE type effects, i.e., data reduction. However, trials with a pitched sound did produce fairly similar results.
SEE ALSO: HILITE TRACE.
End of FOCUS FOCUS
Return to List of functions which focus
spectral data at top of this file
Return
to Main Index for the CDP System.
Return
to Spectral Index
NB: There is a newer version of this process: SPECFOLD.
Typical command line:
focus fold raspdt.ana raspdtfold.ana 100 900 -x
inanalysisfile input analysis file
outanalysisfile output analysis file
lofrq lowest frequency of range into which the spectrum is folded
hifrq highest frequency of range into which the spectrum is folded
-x fuller spectrum
lofrq and hifrq may vary over time
This process octave transposes all the analysis data of a sound into a specified frequency range. Anything below the range is octave-transposed up until it is in (the bottom of) the range. Anything above gets octave-transposed down until it is in (the top of) the range.
If more than two pieces of transposed information compete for the same channel in the resultant file, the data of the lowest amplitude is thrown away.
The key here is the frequency band: high wide it is and where in the frequency range it is placed. Fairly narrow bands seem the most useful for this function, otherwise too much of the original comes through, so there is not very much change to the sound.
A low-placed range such as 100 to 900 (Hz) produces a deep, muffled sound, with the amount of internal modulation dependent on how much the spectrum of the original input changes. With a rasping spring sound, this produced an 'eerie wind', such as one might imagine a passing spaceship to sound like. On a pitched sound, these parameters resulted in a double pitch: the original and the folded down.
A high-placed range such as 1800 to 3200 yields a fairly high and thin, slightly ringing sound. With the rasping spring sound, this prouced a somewhat ethereal sound. On a pitched sound, these parameters were rather high, and not very much of the input remained.
The fuller spectrum option does seem to strengthen the effect, perhaps bringing through a little more of the original sound.
This process will focus the energy in a tight band while retaining the pitch characteristics (because of the octave-transpose). It is therefore a way to intensify a sound. However there may also be a radical and somewhat unpredictable transformation of the sound, depending on the input. It seems most effective with sounds containing many and complex partials.
SPECFOLD
SEE ALSO: STRANGE INVERT (NB:
seems to overload easily).
End of FOCUS FOLD
Return to List of functions which focus
spectral data at top of this file
Return
to Main Index for the CDP System.
Return
to Spectral Index
Typical command line:
focus freeze 3 raspdt.ana raspdtfrz.ana freeze.txt
(See example datafiles in the text below.)
1 freeze channel amplitudes
2 freeze channel frequencies
3 freeze channel amplitudes & frequencies
inanalysisfile input analysis file
outanalysisfile output analysis file
datafile text file containing times at which the spectrum is frozen. These times may be preceded by character markers:
a use window here as freezewindow for spectrum AFTER this time
b use window here as freezewindow for spectrum BEFORE this time
Otherwise, times are end/start of freeze established at one of these markers.
FOCUS FREEZE freezes the spectrum on one window's frequencies or amplitudes. If the frequencies are frozen, the amplitudes continue to move, and v.vs.: if the amplitudes are frozen, the frequencies continue to move.
The following datafiles illustrate how a (AFTER) and b (BEFORE) work.
marker× (action) a1.0 (freeze AFTER this time) 3.0 (unfreeze) a4.0 (freeze AFTER this time) 5.0 (unfreeze) Here we take the values of the window at 1.0 sec. and freeze them until the 3 sec. mark. Then the window is unfrozen, and the sound continues as normal until 4.0, whereupon it is frozen again with the values at 4.0 sec., until 5.0. (The action comments are of course not included in the file!). Similarly:
marker× (action) 2.0 (no freezing action here) b3.0 (value taken here and applied backwards until 2.0) 4.0 (unfrozen between 3.0 and 4.0) b5.0 (value taken here and applied backwards until 4.0) One can also mix a and b as long as the two operations don't overlap. E.g.,
marker× (action) a2.0 (freezing starts here, value taken from 2.0) 3.0 (unfreeze) 4.0 (freezing starts here, value taken from 5.0) b5.0 (value taken from here and applied backwards until 4.0)
This process prolongs specific timbral configurations in the sound, so it is a way of stopping the continuous spectral evolution which normally takes place in a sound. The results will vary enormously depending on the source and of course the contents of the spectrum at the freeze points. Static and stepped qualities are introduced into the sound by this process.
End of FOCUS FREEZE
Return to List of functions which focus
spectral data at top of this file
Return
to Main Index for the CDP System.
Return
to Spectral Index
Typical command line:
focus hold raspdt.ana raspdthold.ana holds.txt
(See example datafile in the text below.)
inanalysisfile input analysis file
outanalysisfile output analysis file
datafile contains a list of paired times at which the spectrum is held and a hold-duration for each time. These data items must be paired correctly.
The process expands each hold window to the duration given, before proceeding to the next window. The output file is therefore longer than the input file. The timbral quality of the held sections depends on what is happening at the point(s), the window(s), where the holds occur.
This is like SPEC MAGNIFY except that a list of times and hold-durations can be given. The key is to identify the times when the most interesting spectral data is present. Alternatively, you may just want to introduce some additional sustaining. Note that the soundfile plays normally at the times when the holds finish. This means that you can create outputs which mix normal and held sections in various ways.
For example, a useful objective is to alternate the original sound with the held spectrum. This is done in the following file (for a 5 sec. source), where the length of time held (0.6 sec.) is shorter than the time-interval to the next hold point (1 sec.). Thus the sound is 'released' after 1 second and the original sounds for 0.4 second until the next hold point is reached:
Hold_start_time Length_of_time_held 0.5 0.6 1.5 0.6 2.5 0.6 3.5 0.6 4.5 0.4The above alternation of original and held was very noticeable with a complex input such as the spring strum (raspdt.wav). However, with a clear, steady pitch such as a horn sound, there were some light surface changes, but overall a smooth flow between original and held this could therefore be a useful way to extend, i.e., lengthen, a pitched sound, as it adds the held sections to the duration of the soundfile.
End of FOCUS HOLD
Return to List of functions which focus
spectral data at top of this file
Return
to Main Index for the CDP System.
Return
to Spectral Index
Typical command line:
focus step raspdt.ana raspdtstep.ana 0.1
inanalysisfile input analysis file
outanalysisfile output analysis file
timestep duration of the steps. Must be >= the duration of two analysis frames. The value here is rounded internally to a multiple of analysis frame time.
This is an automatic FREEZE of the spectrum at intervals of timestep, similar to the familiar 'sample-hold' effect. The main thing here is that the time intervals are regular. Thus the spectrum of the sound, which normally is constantly changing, is frozen with the values at the beginning of each timestep.
The effect with fairly large timesteps can be a relatively slow-moving sample-hold effect. But as the duration of the timestep was shortened, the complex source developed regular 'jangling' effect, while the clearly pitched source only changed its tone.
The 'jangling' effect seems to occur with timestep at about 0.1 or even 0.05. At 0.25, the sample-hold is quite noticeable, rather slow at 0.5 and probably too slow at 1.0 but this setting could be used to produce sections of steady-state material that can then be CUT and used elsewhere.
Timestep can be very small. The Phase Vocoder reference manual explains that the value shown by DIRSF for the sample rate of an analysis file is actually frame rate: the number of analysis frames per second, e.g. 172 or 344 etc. (The sample rate and value for -N both alter the frame rate.) So if you divide 1 sec by the frame rate, you will learn the duration of one frame: e.g., 1/344 = 0.0029 (2.9ms). As timestep gets larger, timbral alteration increases.
It is recommended that you start exploring this function by using very small values for timestep. This should produce a granulation effect. But note that it is doing this without creating enveloped 'grains' as does MODIFY BRASSAGE. It is working directly on the spectrum of the sound.
End of FOCUS STEP
Return to List of functions which focus
spectral data at top of this file
Return
to Main Index for the CDP System.
Return
to Spectral Index
NB: This is a newer version of FOCUS FOLD.
1 Fold the spectrum.
2 Invert the spectrum.
3 Randomise the spectrum.
inanalfile input analysis file.
outanalfile output analysis file.
stt lowest spectral channel to process (Range: >=1)
len number of channels to process (>= 4); for folding (Mode 1) this must be an even number.
cnt (Mode 1) number of folds (less than len).
seed (Mode 3) seed value sets a specific random permutation of channels. (Range: 1 to 64)
-a process the amplitudes only. Default: process frequencies and amplitudes.
SPECFOLD is a newer alternative to the processes FOCUS FOLD and SPECNU SLICE(Mode 5: Invert spectrum), offering also a randomising of spectral frequencies. For all three modes, the frequency range to work within is set by the lowest channel (stt) and the number of channels len to process above that. (The utility SPECINFO FREQUENCY reports the centre frequency for any given channel, while SPECINFO CHANNEL gives the channel for a given frequency.)
The folding process in Mode 1 is as follows:
- Channels in the fold group are split into two equal sets of consecutive channels.
- Values in the first set are moved from channel N to channel 2N.
- Values in the second set are assigned to remaining channels, in descending order.
- cnt sets the number of fold groups to fit into the frequency range defined by stt and len.
Mode 2 inverts the spectrum, putting high frequency channels into low ones and vice versa, while Mode 3 randomly permutates the channels within the channel range.
Setting the -a flag processes just the amplitudes, not the frequencies; in a typical sound with harmonics reducing in amplitude as frequency rises, this should make the sound brighter. Overall, the channel settings tend to brighten the sound considerably, above the lowest channel (set this higher to preserve the bass).
...
SEPCNU SLICE Mode 5 (invert)
STRANGE INVERT
SPECFNU: RANDOMISE PARTIALS (Mode 19)
End of SPECFOLD
Return to List of functions which focus spectral data at top of this file
Return
to Main Index for the CDP System.
Return
to Spectral Index
Example command line to sustain until louder data is present:
superaccu superaccu 3 rasp.ana raspsuperaccu.ana supaccutune.txt -d0.5 -g0.9
(supaccutune.txt: 60 63 67 70 73) (This example seemed to produce rather too much 'resonant tail' the last 10 seconds were inaudible; the input file was 10 seconds long. So you may need to CUT some of the latter part of the output analysis file with SPEC CUT.)
1 Operates like FOCUS ACCU
2 Forces the (start of) resonances to the tempered scale
3 The frequencies are specified in a 'tuning' file (a 'harmonic set')
4 The frequencies and their octaves are specified in a 'tuning' file (a harmonic field)
inanalysisfile input analysis file
outanalysisfile output analysis file
tuningtextfile text file containing the set of frequencies in (possibly fractional) MIDI values (1-128) for tuning the onsets of the resonance
[-ddecay] sustained channel data attenuates by the factor decay per second. Possible Range: >0.0 to 0.9 Default: 1.0 (no attenuation)
[-gglis] sustained channel data glissandos at glis octaves per second. Approximate Range: -11.7 to 11.7 Default: 0
[-r] reassign the glissandoing pitches to appropriate channels
Superaccu is an extension of FOCUS ACCU, with additional modes to tune the sustained spectral channels to a resonance template. The tuning has a remarkable effect on speech, especially in Mode 2.
For SUPERACCU, you might have a statement saying something like: "Warning: High values of decay can generate very long soundfiles." I tried a Decay rate of .88 with a 6 second speech file, and got a 78 second long ringing after the original talking. Quite lovely, but something to warn folks about. (Warren Burt)
...
End of SUPERACCU
Return to list of FOCUS functions at
top of this file
Return to Main Index for
the CDP System.
Return to Spectral Index