This workshop is designed for people with little experience in using Praat—a cross-platform, open-source phonetics program which assists in the recording, annotation, and analysis of speech sounds. Through the completion of short exercises, participants will be guided through a simple phonetics project from the recording stage to the report-writing stage.
In particular, we will 1) use Praat to record English vowels in /hVd/ environments (e.g. heed, hard, hair’d, horde, who’d), 2) learn to identify and label the vocalic components within the speech signal, 3) extract data of these vowels into tables for analysis, 4) produce relevant graphics both in Praat and a spreadsheet program (e.g. Google Sheets, or Excel), and 5) produce a brief report of the vowels, based on a supplied document template.
No practical experience with acoustic phonetics is assumed, though some basic familiarity of linguistics and speech production is preferred (e.g. completed a LING101 course). Participants should come with a laptop and with Praat downloaded and installed.Download links: MacOS, Windows, Linux
Praat assists us in carrying out phonetic analyses of speech sounds, but just what are phonetic analyses anyway? Roughly stated, they are comparisons of acoustic measurements between two or more groups of speech sounds, and Praat helps us deal with the practicalities of grouping speech sounds of interest (e.g. word-initial vs. word-medial vowels, interrogative vs. declarative sentences), and extracting acoustic measurements of interest (e.g. pitch, intensity, formant structure) for comparison.
The goal of the workshop will be to produce a short report with 4 sections (described below in the side notes). Don’t worry if you don’t understand all of the vocabulary in the report text—you’ll be familiar with them by the end of the workshop!
In this report, we compare first and second formant averages of five vowels reported for Australian-English speakers in Cox (2006Cox, Felicity. 2006. “The Acoustic Characteristics of/hVd/Vowels in the Speech of Some Australian Teenagers.” Australian Journal of Linguistics 26 (2): 147–79.) with averages collected from a participant NS, a 28-year-old male Burmese-English bilingual.11Describe the aim of your analysis and participant information. We recorded five words in the /hVd/ context (heed, hard, horde, who’d, hair’d) using the built-in microphone on a 2016 Macbook Pro in a relatively quiet meeting room at the Australian National University.22Describe items recorded and recording environment.
The vowels were annotated in the Praat Editor Window with only the spectrogram visible (pitch, intensity, and formant contours were turned off), and using the default spectrogram settings (View range: 0-5000 Hz, Window length: 0.005, Dynamic range: 70 dB).33Describe annotation environment and annotation criteria. As illustrated in the figure below, the starts of vowels were placed at the beginning of low frequency components (indicating the end of the voiceless fricative /h/), and the ends of vowels were placed at the termination of high frequency components (signalling the start of the stop closure /d/).
The calculated formant averages were located on a F1-F2 plane along with average values for AusE male and female speakers from Cox (2006Cox, Felicity. 2006. “The Acoustic Characteristics of/hVd/Vowels in the Speech of Some Australian Teenagers.” Australian Journal of Linguistics 26 (2): 147–79.). As observed in the figure below, the acoustic data indicated that NS’s heed vowel was produced in a location effectively identical to the AusE Male average, while their who’d was relatively more backed than both the AusE female and male averages.44Produce vowel plot, and make some observations.
All the theory you need for this workshop can be found in an excerpt from Joe Wolfe’s excellent video “Resonances & Formants” (1m17s to 2m32s):
I provide here a transcript of the excerpt (emphasis mine):
Here’s a simplified demonstration, I have a source with many different frequency components present, and a flexible tube. If I change the geometry of the tube, I change the frequencies of its resonances, and so change the formants.
If I open my mouth more, the tract is a bit more like a cone than a cylinder, so the first resonance rises; […and] conversely [falls], if I close it. So the mouth opening largely determines the first resonance and the formant.
Whether the tongue constriction is forward or back strongly influences the second resonance.
The vowels are traditionally located on a plot of mouth opening versus tongue position, which is almost the same as a plot of first formant versus second, with the axes reversed.
Before we define what a spectrogram is, let’s have a look at the Swiss Alps! The figure below shows, separately, the height of various mountain peaks (e.g. Eigner: 3970 m) and their geographical location on Google Maps.Sources: MySwitzerland.com Grindelwald Skimap and Google Maps on Eiger, Switzerland
One way that we can incorporate altitude information into a map is to use a color gradient to indicate the metres above sea level. For example, we can see a small patch of white in the circle below, indicating the group of mountain peaks well above 3000 m.Map source: Wikimedia Commons
Now, let’s start by looking at two spectral slices, graphs indicating how much sound energy was detected (y-axis) for various frequencies (x-axis) at some given point in time:
Notice above how Slice B has a bigger bump at the start, compared to Slice A? In other words, there’s more acoustic energy at the lower frequencies in Slice B than in Slice A. We see this in the spectrogram as the dark band at the bottom of Slice B, and the relative lightness at the bottom of Slice A. So, just as a terrain map indicates how altitude varies by latitude and longitude, a spectrogram indicates how sound energy varies by frequency and time.
New
’, click Record mono Sound...
which will open the SoundRecorder
in a separate window.Record
to begin recording your vowels and Stop
to finish
Play
to playback the most recent recording, or Record
again to overwrite it.my_vowels
) in the Name
field in the bottom right corner of the SoundRecorder
window, and then press Save to list & Close
. You will see your recording (e.g. ‘1. Sound my_vowels
’) in the list of Objects
.Objects
list (e.g. 1. Sound my_vowels
), or if you had pre-recorded your vowels before the workshop, click Open
then Read from file...
.Annotate -
button then To TextGrid...
, and enter the values below: Objects
list, select both your TextGrid and Sound files (Command + Click for MacOS, Control + Click for Windows), and then click View & Edit
Click the animated image to view the video which you can play/pause to follow it at your own pace.
Praat Picture
window, make a selection (big blue rectangle) where you want to draw the picture.Spectrum -> Draw visible spectrogram...
to draw the spectrogram, and File -> Draw visible TextGrid...
to draw the annotation data
Garnish
adds text such as axis labels, so it’s best to garnish only one of your layers (usually the last)File -> Save as 300-dpi PNG file...
Click the animated image to view the video which you can play/pause to follow it at your own pace.
We’re going to do this step twice. Once manually, and then once semi-automatically with Google Sheets.55If you’re really keen and are familiar with R, you can also do this step in R! By the end of this stage, however you do it, you should have a table as the one shown below. Go to the plot interface to make an F1-F2 plot with these values. Note there are many ways of measuring formant values other than averaging over an interval, but for introductory purposes we’ll stick with the average for this tutorial.
Vowel | F1_average | F2_average |
---|---|---|
heed | 313 | 2332 |
hard | 737 | 1073 |
horde | 367 | 766 |
who’d | 306 | 1249 |
hair’d | 397 | 1913 |
Formant
menu then Show formants
Formant
menu and then Get first formant
. The Praat Info
window will then appear with the mean first formant of within interval, note the value down somewhere (in a text editor, or on a piece of paper).Click the animated image to view the video which you can play/pause to follow it at your own pace.
annotations.csv
and Formant data as formants.csv
.Click the animated image to view the video which you can play/pause to follow it at your own pace.
annotations.csv
and formants.csv
into Google Sheets, ticking the option of Replace current sheet
for the first one, and then Inset new sheet(s)
for the second one.66If you’d prefer to do these next steps in R, here’s the code: https://git.io/vbfQ2.Click the animated image to view the video which you can play/pause to follow it at your own pace.
From the annotations
sheet, query the formants
sheet using the QUERY()
function:
=QUERY(formants!A:C, "select avg(C) where A >= 0.42 and A <= 0.74 label avg(C) ''", 1)
Or, use the &
to concatenate in values from the annotations
sheet:
=QUERY(formants!A:C, "select avg(C) where A >= " & A2 &" and A <= " & D2 & " label avg(C) ''", 1)
Praat Intro: http://www.fon.hum.uva.nl/praat/manual/Intro.html. Note this tutorial is also available from within Praat (Help -> Praat Intro
)
Will Styler’s very comprehensive and regularly updated guidebook Using Praat for Linguistic Research http://savethevowels.org/praat/
Sidney Wood’s Praat for Beginners https://swphonetics.com/praat/introduction/