FluidSynth 1.0 — Developer Documentation
Peter
Hanappe
1.0
2003-12-11
hanappe
First attempt.
2003
Copyright Peter Hanappe
All the source code examples in this document are in the
public domain; you can use them as you please. This document is
licensed under the Creative Commons Attribution License. To view
a copy of this license, visit
http://creativecommons.org/licenses/by/1.0/ or send a letter to
Creative Commons, 559 Nathan Abbott Way, Stanford, California
94305, USA. The FluidSynth library is distributed under the GNU
Library General Public License. A copy of the GNU Library
General Public License is contained in the FluidSynth package;
if not, write to the Free Foundation, Inc., 59 Temple Place,
Suite 330, Boston, MA 02111-1307 USA.
FluidSynth
software
synthesizer
SoundFont
Linux
audio
development
documentation
HOWTO
FluidSynth is a software synthesizer based on the
SoundFont 2 specifications. The synthesizer is available as a
shared object that can easily be reused in any application that
wants to use wavetable synthesis. This documents explains the
basic usage of FluidSynth. Some of the more advanced features
are not yet discussed but will be added in future
versions.
Disclaimer
This documentation, in its current version, is probably
outdated and most certainly incomplete. As always, the source code
is the final reference.
SoundFont(R) is a registered trademark of E-mu Systems, Inc.
Introduction
FluidSynth can easily be embedded in an application. It has
a main header file, fluidsynth.h, and one dynamically linkable
library. FluidSynth runs on Linux, MacOS 9, MacOS X, and the Win32
platforms. It has audio and midi drivers for all mentioned
platforms but you can use it with your own drivers if your
application already handles audio and MIDI input/output. This
document explains the basic usage of FluidSynth and provides
examples that you can reuse.
Creating and changing the settings
Before you can use the synthesizer, you have to create a settings
object. The settings objects is used by all components of the
FluidSynth library. It gives a unified API to set the parameters
of the audio drivers, the midi drivers, the synthesizer,
andsoforth. A number of default settings are defined by the
current implementation. In future versions, the use of the
settings will probably be generalized.
All settings have a name that follows the "dotted-name"
notation. For example, "synth.polyphony" refers to the number of
voices (polyphony) preallocated by the synthesizer. The settings
also have a type. There are currently three types: strings,
numbers (double floats), and integers. You can change the values
of a setting using the fluid_settings_setstr,
fluid_settings_setnum, and
fluid_settings_setint functions. For example:
#include <fluidsynth.h>
int main(int argc, char** argv)
{
fluid_settings_t* settings = new_fluid_settings();
fluid_settings_setint(settings, "synth.polyphony", 128);
delete_fluid_settings(settings);
return 0;
}
The API contains the functions to query the type, the current
value, the default value, the range and the "hints" of a
setting. The range is the minumum and maximum value of the
setting. The hints gives additional information about a
setting. For example, whether a string represents a filename. Or
whether a number should be interpreted on on a logarithmic
scale. Check the API documentation for a description of all
functions.
Creating the synthesizer
To create the synthesizer, you pass it the settings object, as
in the following example:
#include <fluidsynth.h>
int main(int argc, char** argv)
{
fluid_settings_t* settings;
fluid_synth_t* synth;
fluid_settings_t* settings = new_fluid_settings();
synth = new_fluid_synth(settings);
/* Do useful things here */
delete_fluid_synth(synth);
delete_fluid_settings(settings);
return 0;
}
The default settings should be fine for most uses. A detailed
description of all the settings used by the synthesizer described
below.
Synthesizer settings
synth.gain
Type
number
Default
0.2
Min-Max
0.0-10.0
Description
The gain is applied to the final or master output of the
synthesizer. It is set to a low value by default to avoid the
saturation of the output when random MIDI files are played.
synth.sample-rate
Type
number
Default
44100
Min-Max
22050-96000
Description
The sample rate of the audio generated by the synthesizer.
synth.polyphony
Type
integer
Default
256
Min-Max
16-4096
Description
The polyphony defines how many voices can be played in parallel. The
number of voices is not necessarily equivalent to the number of notes
played simultaniously. Indeed, when a note is struck on a specific
MIDI channel, the preset on that channel may created several voices,
for example, one for the left audio channel and one for the right
audio channels. The number of voices activated depends on the number
of instrument zones that fall in the correspond to the velocity and
key of the played note.
synth.midi-channels
Type
integer
Default
16
Min-Max
16-256
Description
This setting defines the number of MIDI channels of the
synthesizer. The MIDI standard defines 16 channels, so most hardware
keyboards are limited to 16. If you plan to use the synthesizer as a
plugin in an application, it might be interesting to set the number of
channels to a larger value. In this case you can program a greater
number of presets.
synth.reverb.active
Type
string
Default
"yes"
Description
When set to "yes" the reverb effects module is activated. Otherwise,
no reverb will be added to the output signal. Note that when the
reverb module is active, the amount of signal send to the reverb
module depends on the "reverb send" generator defined in the
SoundFont.
synth.chorus.active
Type
string
Default
"yes"
Description
When set to "yes" the chorus effects module is activated. Otherwise,
no chorus will be added to the output signal. Note that when the
reverb module is active, the amount of signal send to the chorus
module depends on the "chorus send" generator defined in the
SoundFont.
synth.ladspa.active
Type
string
Default
"no"
Description
When set to "yes" the LADSPA subsystem will be called. This subsystem
allows to load and interconnect LADSPA plugins. The output of the
synthesizer is processed by the LADSPA subsystem. Note that the
synthesizer has to be compiled with LADSPA support. More information
about the LADSPA subsystem later.
synth.audio-groups
Type
integer
Default
1
Min-Max
1-128
Description
By default, the synthesizer outputs a single stereo signal. Using this
option, the synthesizer can output multichannel audio.
synth.effects-channels
Type
integer
Default
2
Min-Max
2-2
Description
synth.verbose
Type
string
Default
"no"
Description
When set to "yes" the synthesizer will print out information
about the received MIDI events to the stdout. This can be helpful
for debugging. This setting can not be changed after the synthesizer
has started.
synth.dump
Type
string
Default
"no"
Description
Creating the audio driver
The synthesizer itself does not write any audio to the audio
output. This allows application developers to manage the audio
output themselves if they wish. The next section describes the use
of the synthesizer without an audio driver in more detail.
Creating the audio driver is straightforward: set the appropriate
settings and create the driver object. Because the FluidSynth has
support for several audio systems, you may want to change which
one you want to use. The list below shows theaudio systems that
are currently supported. It displays the name, as used by the
fluidsynth library, and a description.
alsa: Advanced Linux Sound Architecture
oss: Open Sound System (Linux)
jack: JACK Audio Connection Kit (Linux, Mac OS X)
portaudio: Portaudio Library (MacOS 9 & X, Windows, Linux)
sndmgr: Apple SoundManager (Mac OS Classic)
coreaudio: Apple CoreAudio (MacOS X, experimental)
dsound: Microsoft DirectSound (Windows)
The default audio driver depends on the settings with which
FluidSynth was compiled. You can get the default driver with
fluid_settings_getstr_default(settings, "audio.driver"). To get
the list of available drivers use the
fluid_settings_foreach_option
function. Finally, you can set the driver with
fluid_settings_setstr. In most cases, the
default driver should work out of the box.
Additional options that define the audio quality and latency are
"audio.sample-format", "audio.period-size", and
"audio.periods". The details are described later.
You create the audio driver with the
new_fluid_audio_driver function. This
function takes the settings and synthesizer object as
arguments. For example:
void init()
{
fluid_settings_t* settings;
fluid_synth_t* synth;
fluid_audio_driver_t* adriver;
settings = new_fluid_settings();
/* Set the synthesizer settings, if necessary */
synth = new_fluid_synth(settings);
fluid_settings_setstr(settings, "audio.driver", "jack");
adriver = new_fluid_audio_driver(settings, synth);
}
As soon as the audio driver is created, it will start playing. The
audio driver creates a separate thread that runs in real-time mode
(is the application has sufficient privileges) and call the
synthesizer object to generate the audio.
There are a number of general audio driver settings. The
audio.driver settings defines the audio subsystem that will be
used. The audio.periods and audio.period-size settings define the
latency and robustness against scheduling delays. There are
additional settings for the audio subsystems used. They will be
documented later.
General audio driver settings
audio.driver
Type
string
Default
alsa (Linux), dsound (Windows), sndman (MacOS9), coreaudio (MacOS X)
Options
alsa, oss, jack, dsound, sndman, coreaudio, portaudio
Description
The audio system to be used.
audio.periods
Type
int
Default
16 (Linux, MacOS X), 8 (Windows)
Min-Max
2-64
Description
The number of the audio buffers used by the driver. This
number of buffers, multiplied by the buffer size (see setting
audio.period-size), determines the maximum latency of the audio
driver.
audio.period-size
Type
int
Default
64 (Linux, MacOS X), 512 (Windows)
Min-Max
64-8192
Description
The size of the audio buffers (in frames).
audio.sample-format
Type
string
Default
"16bits"
Options
"16bits", "float"
Description
The format of the audio samples. This is currently only an
indication; the audio driver may ignore this setting if it can't
handle the specified format.
Using the synthesizer without an audio driver
It is possible to use the synthesizer object without creating an
audio driver. This is desirable if the application using
FluidSynth manages the audio output itself. The synthesizer has
several API functions that can be used to obtain the audio output:
fluid_synth_write_s16 fills two buffers (left
and right channel) with samples coded as signed 16 bits (the
endian-ness is machine
dependent). fluid_synth_write_float fills a
left and right audio buffer with 32 bits floating point
samples. For multi channel audio output, the function
fluid_synth_nwrite_float has to be used.
The function fluid_synth_process is still
experimental and its use is therefore not recommended but it will
probably become the generic interface in future versions.
Loading and managing SoundFonts
Before any sound can be produced, the synthesizer needs a
SoundFont. For a discussion on SoundFont please refer to some
other, not yet existing, therefore virtual document.
SoundFonts are loaded with the
fluid_synth_sfload function. The function
takes the path to a SoundFont file as argument and a boolean to
indicate whether the presets of the MIDI channels should be
updated after the SoundFont is loaded. More on the preset updates
below.
The synthesizer can load any number of SoundFonts. This is an
advantage, of course, but there are some issues that you must be
aware of. The presets in a SoundFont are identified by their bank
and preset number. The MIDI specifications allows the change of a
preset on a MIDI channel using the combination of "bank select"
and the "program change" messages. An ambiguity arrizes when a
preset with a specific bank and preset number is defined in
multiple loaded SoundFonts. This is solved by searching the
SoundFonts in the inverse order they were loaded, i.e. the lastly
loaded SoundFont is searched first for the request preset
(identified by bank and preset number) then the on but last loaded
SoundFont, and so on until. The first preset found is then
used. You can somehow consider the SoundFonts placed on a
stack. The SoundFont on top of the stack is inspected first,
followed by the SoundFont down on the stack. Newly loaded
SoundFonts are always placed on top of the stack. This is how
commercial, hardware synthesizers work. The inconvenience is that
a preset in a SoundFont at the bottom end of the stack may be
masked by a preset in a SoundFont at the top of the stack. Using
the standard MIDI messages, bank select and program change, there
is no way to select a masked preset. However, FluidSynth has an
API function to unambiguously select a preset
(fluid_synth_program_select). This function
is not invokeable through MIDI messages, though.
The fluid_synth_sfload function returns the
unique identifier of the loaded SoundFont, or -1 in case of an
error. This identifier is used in subsequent management functions:
fluid_synth_sfunload removes the SoundFont,
fluid_synth_sfreload reloads the
SoundFont. When a SoundFont is reloaded, it retains it's ID and
position on the SoundFont stack.
Additional API functions are provided to get the number of loaded
SoundFonts ot to get a pointer to the SoundFont.
Another issue that needs some explanation is the reprogramming of
the presets after a SoundFont load or unload. The default behavior
of commercial synthesizers is to reset all the preset that are
programmed on the MIDI channels when a SoundFont is loaded or
unloaded. Consider the case where MIDI channel 1 uses preset (0,
0) (the couple indicates the bank and program number). This preset
was found in the SoundFont with ID 3, for example. When a new
SoundFont is loaded that also contains a preset with bank number 0
and program number 0, then the newly loaded preset will be used on
channel 1 for future events. This behavior is as if a bank select
and program change message is send to all channels after a
load/unload using the channel's bank and program number. This may
be sometimes confusing or unwanted. A user may not want to loose
its preset setup when a new SoundFont is loaded. To avoid the
reprogramming of the presets, the third parameter to the
fluid_synth_sfload and
fluid_synth_sfunload functions should be set
to zero.
Sending MIDI events
Once the synthesizer is up and running and a SoundFont is loaded,
most people will want to do something usefull with it. Make noise,
for example. The synthesizer aims to be compatible with the MIDI
standard, so it accepts almost all MIDI messages (details on the
MIDI compatibility elsewhere). The MIDI channel messages can be
send using the fluid_synth_noteon,
fluid_synth_noteoff,
fluid_synth_cc,
fluid_synth_pitch_bend,
fluid_synth_pitch_wheel_sens, and
fluid_synth_program_change functions. For
convenience, there's also a
fluid_synth_bank_select function (the bank
select message is normally sent using a control change message).
The following example show a generic graphical button that plays a
not when clicked:
class SoundButton : public SomeButton
{
public:
SoundButton() : SomeButton() {
if (!_synth) {
initSynth();
}
}
static void initSynth() {
_settings = new_fluid_settings();
_synth = new_fluid_synth(_settings);
_adriver = new_fluid_audio_driver(_settings, _synth);
}
/* ... */
virtual int handleMouseDown(int x, int y) {
/* Play a note on key 60 with velocity 100 on MIDI channel 0 */
fluid_synth_noteon(_synth, 0, 60, 100);
}
virtual int handleMouseUp(int x, int y) {
/* Release the note on key 60 */
fluid_synth_noteoff(_synth, 0, 60);
}
protected:
static fluid_settings_t* _settings;
static fluid_synth_t* _synth;
static fluid_audio_driver_t* _adriver;
};
Advanced features, not yet documented
Accessing low-level voice parameters
Reverb settings
Chorus settings
Interpolation settings (set_gen, get_gen, NRPN)
Sequencer
LADSPA effects unit
MIDI router
Multi-channel audio
MIDI tunings
MIDI file player
SoundFont loader