Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Acoustic Measures in Speech and Voice Analysis, Exams of Health sciences

A comprehensive overview of various acoustic measures and concepts used in the analysis of speech and voice. It covers topics such as analog vs. Digital signals, frequency domain analysis, filters, loudness perception, and acoustic measures like jitter, shimmer, and voice range profile (vrp). The document aims to equip the reader with a solid understanding of the fundamental principles and applications of acoustic analysis in the field of speech and voice assessment and treatment. It delves into the importance of using objective, quantitative measures to complement subjective clinical observations, enabling more accurate diagnosis, tracking of progress, and evidence-based practice. The document also highlights the role of acoustic biofeedback in speech therapy, providing real-time visual feedback to patients for improved outcomes.

Typology: Exams

2023/2024

Available from 08/27/2024

Examproff
Examproff 🇺🇸

1

(1)

6K documents

1 / 7

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
COMD 5070 Exam 1 -Answered
3D spectrogram features - ANSWER-X axis =time /Y axis= frequency /darkness
=intensity
4 features of the scientific method - ANSWER-1) Science is empirical- we collect data
and interpret it (based on data)
2) Science is Deterministic - Things are not random. Because of cause and effect, we
can draw generalizations from the data that we have and extrapolate
3) Science is predictive - if you do this, then that will happen.
4) Science is parsimonious - use the simplest explanation possible, try to describe
things in economical terms so we can understand it. Use the simplest terms possible,
but no simpler.
A person's ability to go from soft to loud varies depending : - ANSWER-On pitch
Aliasing - ANSWER-When sampling too slowly, you will inaccurately record the original
signal. High, original frequencies will be improperly recorded as lower frequencies.
Think of the sine wave, where only certain dots were captured, so the original waveform
is "smoothed out" or aliased.
Analog - ANSWER-Any change in air pressure results in commensurate change in
electrical signal. Analogous to a thermometer, as temperature increases, mercury rises.
Electrical signal through the wire from the microphone is an analog representation of the
sound waveform that the microphone picked up. Nature of signal precisely reflects the
nature of the sound wave that arrived.
at high intensity, - ANSWER-the sounds seem to be equal in loudness to intensity.
Band pass filter - ANSWER-Removes high and low frequencies but lets through the
frequencies in the middle
Band reject filter - ANSWER-Allows all frequencies except for a specific band in the
middle.
Bit resolution - ANSWER-number of "shades of gray" to represent the number of
amplitude values. Bit resolution is represented by the exponent 2 squared - 2 bit
resolution, 2 to the 16th - 16 bit resolution.
Common analog devices - ANSWER-clocks, watches, thermometers, tape measure,
dimmer switch, tape recorder.
Common digital devices - ANSWER-digital clocks, computer, CD, MP3 player. - things
that work with 1's and 0's.
pf3
pf4
pf5

Partial preview of the text

Download Acoustic Measures in Speech and Voice Analysis and more Exams Health sciences in PDF only on Docsity!

COMD 5070 Exam 1 -Answered

3D spectrogram features - ANSWER-X axis =time /Y axis= frequency /darkness =intensity 4 features of the scientific method - ANSWER-1) Science is empirical- we collect data and interpret it (based on data)

  1. Science is Deterministic - Things are not random. Because of cause and effect, we can draw generalizations from the data that we have and extrapolate
  2. Science is predictive - if you do this, then that will happen.
  3. Science is parsimonious - use the simplest explanation possible, try to describe things in economical terms so we can understand it. Use the simplest terms possible, but no simpler. A person's ability to go from soft to loud varies depending : - ANSWER-On pitch Aliasing - ANSWER-When sampling too slowly, you will inaccurately record the original signal. High, original frequencies will be improperly recorded as lower frequencies. Think of the sine wave, where only certain dots were captured, so the original waveform is "smoothed out" or aliased. Analog - ANSWER-Any change in air pressure results in commensurate change in electrical signal. Analogous to a thermometer, as temperature increases, mercury rises. Electrical signal through the wire from the microphone is an analog representation of the sound waveform that the microphone picked up. Nature of signal precisely reflects the nature of the sound wave that arrived. at high intensity, - ANSWER-the sounds seem to be equal in loudness to intensity. Band pass filter - ANSWER-Removes high and low frequencies but lets through the frequencies in the middle Band reject filter - ANSWER-Allows all frequencies except for a specific band in the middle. Bit resolution - ANSWER-number of "shades of gray" to represent the number of amplitude values. Bit resolution is represented by the exponent 2 squared - 2 bit resolution, 2 to the 16th - 16 bit resolution. Common analog devices - ANSWER-clocks, watches, thermometers, tape measure, dimmer switch, tape recorder. Common digital devices - ANSWER-digital clocks, computer, CD, MP3 player. - things that work with 1's and 0's.

Digital signal - ANSWER-the signal is digital, not the sound, your ears don't hear digits. Digital storage format is more convenient and clearer electroglottograph - ANSWER-Has 2 electrodes that are placed on either side of the larynx directly on the lamina of the thyroid cartilage. A current passes from one to the other through the vocal folds. Equal loudness contours - ANSWER-Created by presenting a known frequency (1, Hz) and then other tones are played and the listener has to adjust the loudness until it sounds the same as the 1,000 Hz tone FFT spectrum - ANSWER-Fast fourier transform - clearly shows range of harmonics that are present in a complex sound. Shows source but not filter ( formants - articulators) - individual peaks for the harmonics Fourier transform - ANSWER-a way to analyze complex sounds by separating them into their components Fourier transform is similar to - ANSWER-light through a spectrum, it separates out the individual colors, like a FT separates out the individual sine waves. So you can now hear all of the individual sounds that make up the harmonic series - this is the spectrum / frequency/domain display. Similar to a list of ingredients of a cake. Frequency - ANSWER-how frequently a waveform repeats. Measured in Hz (cycles per second). Within the same window length, a higher frequency sound wave will show more repetitions than a lower frequency sound wave Frequency Difference limen - ANSWER-smallest change in frequency that is detectable by a listener. As frequency increases, the DL becomes much greater Frequency domain display - ANSWER-is a line spectrum that shows the frequency components of a periodic sound. Frequency is on the X axis and amplitude is on the Y. general rule is that a 6-10 dB increase - ANSWER-is generally perceived as double as loud Graphic representation; - ANSWER-join the dots: the more dots, the better the image, the fewer, the cruder Greatest sensitivity to loudness is between - ANSWER-1,000 - 5,000 Hz, threshold is much higher at very low or very high frequencies Harmonic Series - ANSWER-has harmonics that are multiples of the fundamental and are complex, periodic waveforms which are more pleasing to our ears. High pass filter - ANSWER-allows high frequencies and attenuates low frequencies

increase in su-glottic pressure results in - ANSWER-more forceful collisions of the vocal folds, increased pressure on articulators, pressure in the oral cavity increases, which requires stronger closure of articulators to manage the increased pressure Inharmonic Series - ANSWER-has components that are not quite multiples which are not periodic or predictable and they are not as pleasing Intensity - ANSWER-amplitude or size of a sound.Adjusting volume, is adjusting the physical intensity of sound. Intensity Difference limen - ANSWER-as intensity decreases, Dls become larger Inverse square law - ANSWER-describes how intensity diminishes as it gets further away from a sound source - intensity diminishes in proportion to the square of the distance from the source. If it's 2 units away, it diminishes by x/2 squared (1/4), 3 units: x/3 squared or 1/9 ) it decreases in strength - ANSWER-when they are separate Jitter - ANSWER-Random, rapid cycle to cycle duration - frequency perturbation - fundamental frequeny jumps around large negative voltage - ANSWER-rarefaction Large positive voltage - ANSWER-air compression Laryngeal mirror - ANSWER-Simple tool a dr. might use to view the larynx limitations of acoustic measures - ANSWER-motor equivalence- you can achieve the same sound in several slightly different ways. Loud speech requires - ANSWER-increased subgolttal pressure, more driving pressure for loud speech, larger vocal fold excursions, more forceful vocal fold collisions, larger articulator movements, higher oral pressure, consonant burst release is stronger Loudness - ANSWER-Perceptual characteristic of sound. Listener can judge loudness, there is no equipment that can measure loudness. Human reaction to the amplitude of a sound. Can be measured with a psycho-physical scale. Low pass filter - ANSWER-allows low frequencies and attenuates high frequencies LPC Spectrum - ANSWER-Linear predictive coding - shows spectral envelope. Doesn't reveal much about the sound source or harmonics, but reveals a lot about the formants. You just see peaks that show the formants - what defines one phoneme from another Modulation sounds like - ANSWER-shaky, unsteady, vibrato.

Modulation tremor - ANSWER-rhythmic modulation in frequency and amplitude. Much slower than random cycle to cycle changes in perturbation. Pattern of gradual increase/decrease in frequency or amplitude is spread across many cycles. Narrow band spectrogram - ANSWER-gives clear frequency detail but poor time detail - shows harmonic details in horizontal bands Nyquist frequency - ANSWER-half the sample rate. You determine the highest frequency you want to include in your recording and double it, this will ensure that the playback will go at least as high as the original signal. Nyquist determines the highest frequency you can reproduce in playback. CD sampling rate is 44,100 and it store signals up to 22,050 Hz. Octave - ANSWER-a doubling or halving of the frequency. Up one octave, you double, down one octave, you half the frequency. Peturbation is measured during - ANSWER-sustained phonation Peturbation sounds like - ANSWER-hoarseness, it is random and rapid. Pitch - ANSWER-the subjective measure of frequency; you cannot measure pitch with an instrument. You can have people match pitch perception to frequencies. Possible causes of peturbation - ANSWER-minor irregularities in neuron firing, Air flow may also play a role Quantifying tremor - ANSWER-How rapid is it? How extreme is it? How steady is it? quantization - ANSWER-A number represents signal amplitude, one number per snapshot. Quantization restricts the - ANSWER-amplitude: More decimal places, more level of detail, more available amplitudes, simpler number makes a cruder record of the original sound because there are less choices. Stair step graph. RMS amplitude - ANSWER-A way of calculating the arithmetic mean without having the negative values cancel out the positive. First you square all values, making them positive. Then you take the average of the squared values and then you take the square root. Works for sine waves, and complex waveforms, such as aperiodic sounds. Sample size restricts the - ANSWER-Frequency Sampling rate - ANSWER-The number of snapshots of the sound in time. ( in Hz) 10 samples is a 10 Hz recording

Why do SLPs use technology - ANSWER-1) Overcome listener bias - computer isn't influenced by what it has heard before. It is a consistent, reliable measure.

  1. Describe severity objectively - standardizes measures of speech
  2. Track progress over time - helps demonstrate treatment efficacy
  3. Helps to comply with ASHA's focus on EBP - provides hard data to measure rather than just gut feelings about progress based on experience.
  4. Biofeedback- computer display in real time - ex: visipitch shows intonations on screen in real time, shows them progress as they speak, better than playing back a recording. Why use a logarithmic scale for intensity? - ANSWER-This is necessary because the highest amplitude sound we can perceive is 1 trillion times the softest sound we can perceive (120 dB). Wide band spectrogram - ANSWER-gives clear time detail, poor frequency resolution - shows glottal pulses in vertical striations.