Made by DATEXIS (Data Science and Text-based Information Systems) at Beuth University of Applied Sciences Berlin
Deep Learning Technology: Sebastian Arnold, Betty van Aken, Paul Grundmann, Felix A. Gers and Alexander Löser. Learning Contextualized Document Representations for Healthcare Answer Retrieval. The Web Conference 2020 (WWW'20)
Funded by The Federal Ministry for Economic Affairs and Energy; Grant: 01MD19013D, Smart-MD Project, Digital Technologies
To determine whether a client presents with puberphonia, a complete voice assessment including medical and diagnostic evaluations is recommended. These assessments are performed by otorhinolaryngologists and speech-language pathologists.
Vocal fold imaging techniques are used by clinicians to examine the vocal folds and allows them to detect vocal pathology and assess the quality of the vocal fold vibrations. Laryngeal stroboscopy is the primary clinical tool used for this purpose. Laryngeal stroboscopy uses a synchronized flashing light passed through either a rigid or flexible laryngoscope to provide an image of the vocal fold motion; the image is created by averaging over several vibratory cycles and is thus not provided in real-time. As this technique relies on periodic vocal fold vibration, it cannot be used in patients with moderate to severe dysphonia. High speed digital imaging of the vocal folds (videokymography), another imaging technique, is not subject to the same limitations as laryngeal stroboscopy. A rigid endoscope is used to take images at a rate of 8000 frames per second, and the image is displayed in real time. As well, this technique allows imaging of aperiodic vibrations and can thus be used with patients presenting with all severities of dysphonia.
Acoustic measures can be used to provide objective measures of vocal function. Signal processing algorithms are applied to voice recordings made during sustained phonation or during spontaneous speech. The acoustic parameters which can then be examined include fundamental frequency, signal amplitude, jitter, shimmer, and noise-to-harmonic ratios. However, due to limitations imposed by the algorithms employed, these measures cannot be used with patients who exhibit severe dysphonia.
A behavioural assessment for puberphonia will consist of several types of tasks, and may include:
- Examining for tension in the neck and throat: The clinician will visually examine the area around the larynx to see if the voice box sits high in the throat, and palpate the area to determine whether there is excessive muscular tension.
- Determining the relationship between tension and vocal pitch: The clinician will ask the client to perform warm-up and relaxation exercises such as those listed in the Treatment section below to determine whether the client has access to their modal voice register.
- Establishing vocal range: The clinician will ask the client to produce the lowest and highest pitch that they can, and perform different speaking or singing activities at various pitches.
- Listening for abnormal traits: The clinician will listen for the presence of breathy voice, an indication of speech in the falsetto register, and other distortions of vocal quality.
- Taking aerodynamic measurements: Many individuals with puberphonia may have limited breath support caused by the thoracic or shallow breathing patterns often used to support speech in the falsetto register. These symptoms are assessed using vocal tasks such as maximum phonation time and direct measures of breath support such as glottal airflow and subglottal pressure.
The most effective diagnostic strategy is to perform laryngoscopy during an episode, at which time abnormal movement of the cords, if present, can be observed. If the endoscopy is not performed during an episode, it is likely that the vocal folds will be moving normally, a 'false negative' finding.
Spirometry may also be useful to establish the diagnosis of VCD when performed during a crisis or after a nasal provocation test. With spirometry, just as the expiratory loop may show flattening or concavity when expiration is affected in asthma, so may the Inspiratory loop show truncation or flattening in VCD. Of course, testing may well be negative when symptoms are absent.
The symptoms of VCD are often inaccurately attributed to asthma, which in turn results in the unnecessary and futile intake of corticosteroids, bronchodilators and leukotriene modifiers, although there are instances of comorbidity of asthma and VCD.
The differential diagnosis for vocal cord dysfunction includes vocal fold swelling from allergy, asthma, or some obstruction of the vocal folds or throat. Anyone suspected of this condition should be evaluated and the vocal folds (voice box) visualized. In individuals who experience a persistent difficulty with inhaling, consideration should be given to a neurological cause such as brain stem compression, cerebral palsy, etc.
The main difference between VCD and asthma is the audible stridor or wheezing that occurs at different stages of the breath cycle: VCD usually causes stridor on the inhalation, while asthma results in wheezing during exhalation. Patients with asthma usually respond to the usual medication and see their symptoms resolve. Clinical measures that can be done to differentiate VCD from asthma include:
- rhinolaryngoscopy: A patient with asthma will have normal vocal cord movement, while one with VCD will display vocal cord abduction during inhalation
- spirometry: A change in the measure following the administration of a bronchodilator is suggestive of asthma rather than VCD
- chest radiography: The presence of hyperinflation and peribronchial thickening are indicative of asthma, as patients with VCD will show normal results.
A number of operations that cut one of the nerves of the vocal folds (the recurrent laryngeal nerve) has improved the voice of many for several months to several years but the improvement may be temporary.
An operation called "selective laryngeal adductor denervation-rennervation (SLAD-R)" is effective specifically for adductor spasmodic dysphonia which has shown good outcomes in about 80% of people at 8 years. Post-surgery voices can be imperfect and about 15% of people have significant difficulties. If symptoms do recur this is typically in the first 12 months. Another operation called "recurrent laryngeal nerve avulsion" has positive outcomes of 80% at three years.
Another surgical option is a thyroplasty, which ultimately changes the position or length of the vocal folds. After thyroplasty there is an increase in both objective and subjective measures of speech.
Hypernasality is generally segmented into so-called 'resonance' effects in vowels and some voiced or sonorant consonants and the effects of excess nasal airflow during those consonants requiring a buildup of oral air pressure, such as stop consonants (as /p/) or sibilants (as /s/). The latter nasal airflow problem is termed 'nasal emission', and acts to prevent the buildup of air pressure and thus prevent the normal production of the consonant. In testing for resonance effects without the aid of technology, speech pathologists are asked to rate the speech by listening to a recorded sentence or paragraph, though there is much variability in such subjective ratings, for at least two reasons. First, the acoustic effect of a given velopharyngeal opening varies greatly depending on the degree of occlusion of the nasal passageways. (This is the reason why a stuffy nose from an allergy or cold will sound more nasal than when the nose is clear.) Secondly, for many persons with hypernasal speech, especially hearing impaired, there are also mispronunciations of the articulation of the vowels. It is extremely difficult to separate the acoustic effects of hypernasality from the acoustic effects of mispronounced vowels (examples). Of course, in speech training of the hearing impaired, there is little possibility of making nasality judgments aurally, and holding a finger to the side of the nose, to feel voice frequency vibration, is sometimes recommended.
Voice therapy appears to be ineffective in cases of true spasmodic dysphonia, however as it is difficult to distinguish between spasmodic dysphonia and functional dysphonias and misdiagnosis is relatively common, a trial of voice therapy is often recommended before more invasive procedures are tried. Some also state that it is useful for mild symptoms and as an add-on to botox therapy and others report success in more severe cases. Laryngeal manual therapy, which is massaging of the neck and cervical structures, also shows positive results for intervention of functional dysphonia.
There are several methods for diagnosing hypernasality.
- A speech therapist listens to and records the child while analysing perceptual speech. In hypernasality, the child cannot produce oral sounds (vowels and consonants) correctly. Only the nasal sounds can be correctly produced. A hearing test is also desirable.
- A mirror is held beneath the nose while the child pronounces the vowels. Nasal air escape, and thus hypernasality, is indicated if the mirror fogs up.
- A pressure-flow technique is used to measure velopharyngeal orifice area during the speech. The patient must be at least three to four years old.
- A video nasopharyngeal endoscopy observes velopharyngeal function, movements of the soft palate, and pharyngeal walls. It utilises a very small scope placed in the back of the nasal cavity. The doctor will then ask the child to say a few words. The patient must be at least three to four years old to ensure cooperation.
- A cinefluoroscopy gives dynamic visualisation and can easier be applied to younger children, though it has the disadvantage of exposing the patient to radiation.
- A nasometer calculates the ratio of nasality. The patient wears a headset, where the oral and nasal cavities are separated by a plate. On both sides of the plate are microphones. The ratio calculated by the nasometer indicates the amount of nasality, with a higher ratio indicating more nasality.
The list of treatments mentioned in various sources for presbylarynx includes the following list. Always seek professional medical advice about any treatment or change in treatment plans.
- Voice therapy
In most cases the cause is unknown. However, there are various known causes of speech impediments, such as "hearing loss, neurological disorders, brain injury, intellectual disability, drug abuse, physical impairments such as cleft lip and palate, and vocal abuse or misuse."
Patulous Eustachian Tube (PET) or tube to open remains an ET dysfunction category, difficult to diagnose and to treat. Recent studies suggest that the pathophysiology and etiology of PET is more related to a previous history of otitis media, rather than from weight loss, which was widely recognized as the major causal factor. Simultaneous weight gain can even be observed in some cases. The average age is 30 years, with a female prevalence. It’s rare to find patients of less than 15 years of age, which seems to confirm the opinion that PET is an acquired condition. Chronic upper respiratory inflammatory diseases are almost always associated with PET; half of these patients report previous history of nasal or sinus surgery. Radical posterior / inferior turbinectomies seem to be proportionally connected to PET severity. History of recurrent otitis media with effusion, with tympanostomy and adenoidectomy during childhood is also frequent. Laryngopharyngeal reflux (LPR) is identified in a great proportion of patients. Last epidemiological data indicates that PET results from obstructive ET dysfunction evolving over a long period of time.
Since the surgical management of these two pathologies is at diametrically opposed extremes, it is critical to obtain the correct diagnosis before undertaking any treatment. Visualization of a permanently open tubal valve by endoscopic examination allows settling this question, but its absence does not exclude PET diagnosis. Sonotubometry and recently developed tubomanometry may help in some specific cases. Unfortunately, some PET cases remain difficult to diagnose.
Classifying speech into normal and disordered is more problematic than it first seems. By a strict classification, only 5% to 10% of the population has a completely normal manner of speaking (with respect to all parameters) and healthy voice; all others suffer from one disorder or another.
There are three different levels of classification when determining the magnitude and type of a speech disorders and the proper treatment or therapy:
1. Sounds the patient can produce
1. Phonemic – can be produced easily; used meaningfully and constructively
2. Phonetic – produced only upon request; not used consistently, meaningfully, or constructively; not used in connected speech
2. Stimulate sounds
1. Easily stimulated
2. Stimulate after demonstration and probing (i.e. with a tongue depressor)
3. Cannot produce the sound
1. Cannot be produced voluntarily
2. No production ever observed
Upon examination of a suspected case of patulous Eustachian tube, a doctor can directly view the tympanic membrane with a light and observe that it vibrates with every breath taken by the patient. A tympanogram may also help with the diagnosis. Patulous Eustachian tube is likely if brisk inspiration causes a significant pressure shift.
Patulous Eustachian tube is frequently misdiagnosed as standard congestion due to the similarity in symptoms and rarity of the disorder. Audiologists are more likely to recognize the disorder, usually with tympanometry or nasally delivered masking noise during a hearing assessment, which is highly sensitive to this condition.
When misdiagnosis occurs, a decongestant medication is sometimes prescribed. This type of medication aggravates the condition, as the Eustachian tube relies on sticky fluids to keep closed and the drying effect of a decongestant would make it even more likely to remain open and cause symptoms. The misdiagnosed patient may also have tubes surgically inserted into the eardrum, which increases the risk of ear infection and will not alleviate patulous Eustachian tube. If these treatments are tried and failed, and the doctor is not aware of the actual condition, the symptoms may even be classified as psychological.
Incidentally, patients who instead suffer from the even rarer condition of superior canal dehiscence are at risk for misdiagnosis of patulous Eustachian tube due to the similar autophony in both conditions.
Bogart–Bacall syndrome (BBS) is a voice disorder that is caused by abuse or overuse of the vocal cords.
People who speak or sing outside their normal vocal range can develop BBS; symptoms are chiefly an unnaturally deep or rough voice, or dysphonia, and vocal fatigue. The people most commonly afflicted are those who speak in a low-pitched voice, particularly if they have poor breath and vocal control. The syndrome can affect both men and women.
In 1988 an article was published, describing a discrete type of vocal dysfunction which results in men sounding like Humphrey Bogart and women sounding like Lauren Bacall. BBS is now the medical term for an ongoing hoarseness that often afflicts actors, singers or TV/radio voice workers who routinely speak in a very low pitch.
Treatment usually involves voice therapy by a speech language pathologist.
In 2006, the U.S. Department of Education indicated that more than 1.4 million students were served in the public schools' special education programs under the speech or language impairment category of IDEA 2004. This estimate does not include children who have speech/language problems secondary to other conditions such as deafness; this means that if all cases of speech or language impairments were included in the estimates, this category of impairment would be the largest. Another source has estimated that communication disorders—a larger category, which also includes hearing disorders—affect one of every 10 people in the United States.
ASHA has cited that 24.1% of children in school in the fall of 2003 received services for speech or language disorders—this amounts to a total of 1,460,583 children between 3 –21 years of age. Again, this estimate does not include children who have speech/language problems secondary to other conditions. Additional ASHA prevalence figures have suggested the following:
- Stuttering affects approximately 4% to 5% of children between the ages of 2 and 4.
- ASHA has indicated that in 2006:
- Almost 69% of SLPs served individuals with fluency problems.
- Almost 29% of SLPs served individuals with voice or resonance disorders.
- Approximately 61% of speech-language pathologists in schools indicated that they served individuals with SLI
- Almost 91% of SLPs in schools indicated that they servedindividuals with phonological/articulation disorder
- Estimates for language difficulty in preschool children range from 2% to 19%.
- Specific Language Impairment (SLI) is extremely common in children, and affects about 7% of the childhood population.
Spacing is important to integrate, especially with self-modeling. Repeated and spaced out use of interventions is shown to be the most helpful long-term for learning. Viewing videotapes of self-modeling should be shown over a spaced out period of time of approximately 6 weeks.
Some practitioners believe there would be evidence indicating anxiolytics to be helpful in treating children and adults with selective mutism, to decrease anxiety levels and thereby speed the process of therapy. Use of medication may end after nine to twelve months, once the person has learned skills to cope with anxiety and has become more comfortable in social situations. Medication is more often used for older children, teenagers, and adults whose anxiety has led to depression and other problems.
Medication, when used, should never be considered the entire treatment for a person with selective mutism. While on medication, the person should be in therapy to help them learn how to handle anxiety and prepare them for life without medication.
Antidepressants have been used in addition to self-modeling and mystery motivation to aid in the learning process.
Dysprosody, which may manifest as pseudo-foreign accent syndrome, refers to a disorder in which one or more of the prosodic functions are either compromised or eliminated completely.
Prosody refers to the variations in melody, intonation, pauses, stresses, intensity, vocal quality, and accents of speech. As a result, prosody has a wide array of functions, including expression on linguistic, attitudinal, pragmatic, affective and personal levels of speech. People diagnosed with dysprosody most commonly experience difficulties in pitch or timing control. Essentially, people diagnosed with the disease can comprehend language and vocalize what they intend to say, however, they are not able to control the way in which the words come out of their mouths. Since dysprosody is the rarest neurological speech disorder discovered, not much is conclusively known or understood about the disorder. The most obvious expression of dysprosody is when a person starts speaking in an accent which is not their own. Speaking in a foreign accent is only one type of dysprosody, as the disease can also manifest itself in other ways, such as changes in pitch, volume, and rhythm of speech. It is still very unclear as to how damage to the brain causes the disruption of prosodic function. The only form of effective treatment developed for dysprosody is speech therapy.
Aphonia is defined as the inability to produce voiced sound. A primary cause of aphonia is bilateral disruption of the recurrent laryngeal nerve, which supplies nearly all the muscles in the larynx. Damage to the nerve may be the result of surgery (e.g., thyroidectomy) or a tumor.
Aphonia means "no sound". In other words, a person with this disorder has lost his/her voice.
Articulation problems resulting from dysarthria are treated by speech language pathologists, using a variety of techniques. Techniques used depend on the effect the dysarthria has on control of the articulators. Traditional treatments target the correction of deficits in rate (of articulation), prosody (appropriate emphasis and inflection, affected e.g. by apraxia of speech, right hemisphere brain damage, etc.), intensity (loudness of the voice, affected e.g. in hypokinetic dysarthrias such as in Parkinson's), resonance (ability to alter the vocal tract and resonating spaces for correct speech sounds) and phonation (control of the vocal folds for appropriate voice quality and valving of the airway). These treatments have usually involved exercises to increase strength and control over articulator muscles (which may be flaccid and weak, or overly tight and difficult to move), and using alternate speaking techniques to increase speaker intelligibility (how well someone's speech is understood by peers). With the speech language pathologist, there are several skills that are important to learn; safe chewing and swallowing techniques, avoiding conversations when feeling tired, repeat words and syllables over and over in order to learn the proper mouth movements, and techniques to deal with the frustration while speaking. Depending on the severity of the dysarthria, another possibility includes learning how to use a computer or flip cards in order to communicate more effectively.
More recent techniques based on the principles of motor learning (PML), such as LSVT (Lee Silverman voice treatment) speech therapy and specifically LSVT may improve voice and speech function in PD. For Parkinson's, aim to retrain speech skills through building new generalised motor programs, and attach great importance to regular practice, through peer/partner support and self-management. Regularity of practice, and when to practice, are the main issues in PML treatments, as they may determine the likelihood of generalization of new motor skills, and therefore how effective a treatment is.
Augmentative and alternative communication (AAC) devices that make coping with a dysarthria easier include speech synthesis and text-based telephones. These allow people who are unintelligible, or may be in the later stages of a progressive illness, to continue to be able to communicate without the need for fully intelligible speech.
Psychogenic aphonia is often seen in patients with underlying psychological problems. Laryngeal examination will usually show bowed vocal folds that fail to adduct to the midline during phonation. However, the vocal folds will adduct when the patient is asked to cough. Treatment should involve consultation and counseling with a speech pathologist and, if necessary, a psychologist.
In this case, the patient's history and the observed unilateral immobility rules out function aphonia.
Students identified with a speech and language disability often qualify for an Individualized Education Plan as well as particular services. These include one-on-one services with a speech and language pathologist. Examples used in a session include reading vocabulary words, identifying particular vowel sounds and then changing the context, noting the difference. School districts in the United States often have speech and language pathologists within a special education staff to work with students. Additionally, school districts can place students with speech and language disabilities in a resource room for individualized instruction. A combination of early intervention and individualized support has shown promise increasing long-term academic achievement with students with this disability.
Students might work individually with a specialist, or with a specialist in a group setting. In some cases, the services provided to these individuals may even be provided in the regular education classroom. Regardless of where these services are provided, most of these students spend small amounts of time in therapy and the large majority of their time in the regular education classroom with their typically developing peers.
Therapy often occurs in small groups of three or four students with similar needs. Meeting either in the office of the speech-language pathologist or in the classroom, sessions may take from 30 minutes to one hour. They may occur several times per week. After introductory conversations, the session is focused on a particular therapeutic activity, such as coordination and strengthening exercises of speech muscles or improving fluency through breathing techniques. These activities may take the form of games, songs, skits, and other activities that deliver the needed therapy. Aids, such as mirrors, tape recorders, and tongue depressors may be utilized to help the children to become aware of their speech sounds and to work toward more natural speech production.
Those who are physically mute may have problems with the parts of the human body required for human speech (the esophagus, vocal cords, lungs, mouth, or tongue, etc.).
Trauma or injury to Broca's area, located in the left inferior frontal cortex of the brain, can cause muteness.