Searched for: in-biosketch:yes
person:azadpm01
Employing deep learning model to evaluate speech information in acoustic simulations of Cochlear implants
Sinha, Rahul; Azadpour, Mahan
Acoustic vocoders play a key role in simulating the speech information available to cochlear implant (CI) users. Traditionally, the intelligibility of vocoder CI simulations is assessed through speech recognition experiments with normally-hearing subjects, a process that can be time-consuming, costly, and subject to individual variability. As an alternative approach, we utilized an advanced deep learning speech recognition model to investigate the intelligibility of CI simulations. We evaluated model's performance on vocoder-processed words and sentences with varying vocoder parameters. The number of vocoder bands, frequency range, and envelope dynamic range were adjusted to simulate sound processing settings in CI devices. Additionally, we manipulated the low-cutoff frequency and intensity quantization of vocoder envelopes to simulate psychophysical temporal and intensity resolutions in CI patients. The results were evaluated within the context of the audio analysis performed in the model. Interestingly, the deep learning model, despite not being originally designed to mimic human speech processing, exhibited a human-like response to alterations in vocoder parameters, resembling existing human subject results. This approach offers significant time and cost savings compared to testing human subjects, and eliminates learning and fatigue effects during testing. Our findings demonstrate the potential of speech recognition models in facilitating auditory research.
PMCID:11479273
PMID: 39402071
ISSN: 2045-2322
CID: 5711602
Current status of pediatric auditory brainstem implantation in inner ear malformations; consensus statement of the Third International Pediatric ABI Meeting
Sennaroglu, Levent; Lenarz, Thomas; Roland, J Thomas; Lee, Daniel J; Colletti, Liliana; Behr, Robert; Jiang, Dan; Saeed, Shakeel R; Casselman, Jan; Manrique, Manuel; Diamante, Vicente; Freeman, Simon R M; Lloyd, Simon K W; Zarowski, Andrzej; Offeciers, Erwin; Kameswaran, Mohan; de la Torre Diamante, Daniel Andrés; Bilginer, Burçak; Thomas, Nick; Bento, Ricardo; Sennaroglu, Gonca; Yucel, Esra; Bajin, Munir Demir; Cole, Chelsea; Martinez, Amy; Loggins, Janice; Eisenberg, Laurie S; Wilkinson, Eric P; Bakey, Cheryl A; Carter, Christine L; Herrmann, Barbara S; Waltzman, Susan; Shapiro, William; Svirsky, Mario; Pallares, Norma; Diamante, Gabriela; Heller, Florencia; Palacios, Maria; Diamante, Lic Leticia; Chang, Waitsz; Tong, Michael; Wu, Hao; Batuk, Merve Ozbal; Yarali, Mehmet; Cinar, Betul Cicek; Ozkan, Hilal Burcu; Aslan, Filiz; Hallin, Karin; Rask-Andersen, Helge; Huarte, Alicia; Prieto-Matos, Carlos; Topsakal, Vedat; Hofkens-Van den Brandt, Anouk; Rompaey, Vincent Van; Boudewyns, An; van de Heyning, Paul; Gaertner, Lutz; Shapira, Yisgav; Henkin, Yael; Battelino, Saba; Orzan, Eva; Muzzi, Enrico; Marchi, Raffaella; Free, Rolien; Frijns, Johan H M; Voelker, Courtney; Winter, Margaret; Schrader, Debra; Ganguly, Dianne Hammes; Egra-Dagan, Dana; Diab, Khassan; Dayxes, Nikolai; Nanan, Ashen; Koji, Robinson; Karaosmanoğlu, Ayça; Bulut, Elif Günay; Verbist, Berit; Azadpour, Mahan; Mandala, Marco; Goffi, Maria Valeria; Polak, Marek; Lee, Kathy Y S; Wilson, Katherine; Friedmann, David R; Rajeswaran, Ranjith; Monsanto, Rafael; Cureoglu, Sebahattin; Driver, Sandra; Bošnjak, Roman; Dundar, Gorkem; Eroglu, Ergin
OBJECTIVES/UNASSIGNED:This study aims to synthesize current knowledge and outcomes related to pediatric auditory brainstem implantation (ABI) in children with severe inner ear malformations (IEMs). It highlights the clinical management practices, challenges, and potential future directions for consensus development in this field. METHODS/UNASSIGNED:A systematic review of findings presented at the Third International Pediatric ABI Symposium organized by the Hacettepe Cochlear Implant team between 3 and 5 September 2020 was conducted, incorporating data from 41 departments across 19 countries. Relevant clinical outcomes, imaging techniques, surgical approaches, and rehabilitation strategies were analyzed to identify key trends and variability in practices. RESULTS/UNASSIGNED:The review indicates that children receiving ABIs exhibit diverse auditory outcomes influenced by individual anatomical variations and developmental factors. Early implantation, particularly before the age of three, positively correlates with better auditory and language development. Multicenter experiences underscore the necessity of tailored decision-making, which considers both surgical candidacy and comprehensive rehabilitation resources. DISCUSSION:/UNASSIGNED:The variability in outcomes emphasizes the need for improved consensus and guidelines regarding eligibility, surgical techniques, and multidisciplinary rehabilitation approaches. Notable complications and the necessity for thorough imaging assessments were also identified as critical components affecting clinical decisions. CONCLUSION/UNASSIGNED:A formal consensus statement is warranted to standardize best practices in ABI management. This will not only enhance patient outcomes but also guide future research efforts to address the remaining challenges in the treatment of children with severe IEMs. Enhanced collaboration among team members will be pivotal in achieving these objectives.
PMID: 39607757
ISSN: 1754-7628
CID: 5766122
Valid Acoustic Models of Cochlear Implants: One Size Does Not Fit All
Svirsky, Mario A; Capach, Nicole Hope; Neukam, Jonathan D; Azadpour, Mahan; Sagi, Elad; Hight, Ariel Edward; Glassman, E Katelyn; Lavender, Annette; Seward, Keena P; Miller, Margaret K; Ding, Nai; Tan, Chin-Tuan; Fitzgerald, Matthew B
HYPOTHESIS/OBJECTIVE:This study tests the hypothesis that it is possible to find tone or noise vocoders that sound similar and result in similar speech perception scores to a cochlear implant (CI). This would validate the use of such vocoders as acoustic models of CIs. We further hypothesize that those valid acoustic models will require a personalized amount of frequency mismatch between input filters and output tones or noise bands. BACKGROUND:Noise or tone vocoders have been used as acoustic models of CIs in hundreds of publications but have never been convincingly validated. METHODS:Acoustic models were evaluated by single-sided deaf CI users who compared what they heard with the CI in one ear to what they heard with the acoustic model in the other ear. We evaluated frequency-matched models (both all-channel and 6-channel models, both tone and noise vocoders) as well as self-selected models that included an individualized level of frequency mismatch. RESULTS:Self-selected acoustic models resulted in similar levels of speech perception and similar perceptual quality as the CI. These models also matched the CI in terms of perceived intelligibility, harshness, and pleasantness. CONCLUSION/CONCLUSIONS:Valid acoustic models of CIs exist, but they are different from the models most widely used in the literature. Individual amounts of frequency mismatch may be required to optimize the validity of the model. This may be related to the basalward frequency mismatch experienced by postlingually deaf patients after cochlear implantation.
PMID: 34766938
ISSN: 1537-4505
CID: 5050812
Reducing interaural tonotopic mismatch preserves binaural unmasking in cochlear implant simulations of single-sided deafness
Sagi, Elad; Azadpour, Mahan; Neukam, Jonathan; Capach, Nicole Hope; Svirsky, Mario A
Binaural unmasking, a key feature of normal binaural hearing, can refer to the improved intelligibility of masked speech by adding masking that facilitates perceived separation of target and masker. A question relevant for cochlear implant users with single-sided deafness (SSD-CI) is whether binaural unmasking can still be achieved if the additional masking is spectrally degraded and shifted. CIs restore some aspects of binaural hearing to these listeners, although binaural unmasking remains limited. Notably, these listeners may experience a mismatch between the frequency information perceived through the CI and that perceived by their normal hearing ear. Employing acoustic simulations of SSD-CI with normal hearing listeners, the present study confirms a previous simulation study that binaural unmasking is severely limited when interaural frequency mismatch between the input frequency range and simulated place of stimulation exceeds 1-2 mm. The present study also shows that binaural unmasking is largely retained when the input frequency range is adjusted to match simulated place of stimulation, even at the expense of removing low-frequency information. This result bears implications for the mechanisms driving the type of binaural unmasking of the present study and for mapping the frequency range of the CI speech processor in SSD-CI users.
PMID: 34717490
ISSN: 1520-8524
CID: 5037682
Assessing temporal responsiveness of primary stimulated neurons in auditory brainstem and cochlear implant users
Azadpour, Mahan; Shapiro, William H; Roland, J Thomas; Svirsky, Mario A
The reasons why clinical outcomes with auditory brainstem implants (ABIs) are generally poorer than with cochlear implants (CIs) are still somewhat elusive. Prior work has focused on differences in processing of spectral information due to possibly poorer tonotopic representation and higher channel interaction with ABIs than with CIs. In contrast, this study examines the hypothesis that a potential contributing reason for poor speech perception in ABI users may be the relative lack of temporal responsiveness of the primary neurons that are stimulated by the ABI. The cochlear nucleus, the site of ABI stimulation, consists of different neuron types, most of which have much more complex responses than the auditory nerve neurons stimulated by a CI. Temporal responsiveness of primary stimulated neurons was assessed in a group of ABI and CI users by measuring recovery of electrically evoked compound action potentials (ECAPs) from single-pulse forward masking. Slower ECAP recovery tended to be associated with poorer hearing outcomes in both groups. ABI subjects with the longest recovery time had no speech understanding or even no hearing sensation with their ABI device; speech perception for the one CI outlier with long ECAP recovery time was well below average. To the extent that ECAP recovery measures reveal temporal properties of the primary neurons that receive direct stimulation form neural prosthesis devices, they may provide a physiological underpinning for clinical outcomes of auditory implants. ECAP recovery measures may be used to determine which portions of the cochlear nucleus to stimulate, and possibly allow us to enhance the stimulation paradigms.
PMID: 33434815
ISSN: 1878-5891
CID: 4746742
Effect of Pulse Rate on Loudness Discrimination in Cochlear Implant Users
Azadpour, Mahan; McKay, Colette M; Svirsky, Mario A
Stimulation pulse rate affects current amplitude discrimination by cochlear implant (CI) users, indicated by the evidence that the JND (just noticeable difference) in current amplitude delivered by a CI electrode becomes larger at higher pulse rates. However, it is not clearly understood whether pulse rate would affect discrimination of speech intensities presented acoustically to CI processors, or what the size of this effect might be. Intensity discrimination depends on two factors: the growth of loudness with increasing sound intensity and the loudness JND (or the just noticeable loudness increment). This study evaluated the hypothesis that stimulation pulse rate affects loudness JND. This was done by measuring current amplitude JNDs in an experiment design based on signal detection theory according to which loudness discrimination is related to internal noise (which is manifested by variability in loudness percept in response to repetitions of the same physical stimulus). Current amplitude JNDs were measured for equally loud pulse trains of 500 and 3000Â pps (pulses per second) by increasing the current amplitude of the target pulse train until it was perceived just louder than a same-rate or different-rate reference pulse train. The JND measures were obtained at two presentation levels. At the louder level, the current amplitude JNDs were affected by the rate of the reference pulse train in a way that was consistent with greater noise or variability in loudness perception for the higher pulse rate. The results suggest that increasing pulse rate from 500 to 3000Â pps can increase loudness JND by 60Â % at the upper portion of the dynamic range. This is equivalent to a 38Â % reduction in the number of discriminable steps for acoustic and speech intensities.
PMCID:5962473
PMID: 29532190
ISSN: 1438-7573
CID: 2992622
A Smartphone Application for Customized Frequency Table Selection in Cochlear Implants
Jethanamest, Daniel; Azadpour, Mahan; Zeman, Annette M; Sagi, Elad; Svirsky, Mario A
HYPOTHESIS: A novel smartphone-based software application can facilitate self-selection of frequency allocation tables (FAT) in postlingually deaf cochlear implant (CI) users. BACKGROUND: CIs use FATs to represent the tonotopic organization of a normal cochlea. Current CI fitting methods typically use a standard FAT for all patients regardless of individual differences in cochlear size and electrode location. In postlingually deaf patients, different amounts of mismatch can result between the frequency-place function they experienced when they had normal hearing and the frequency-place function that results from the standard FAT. For some CI users, an alternative FAT may enhance sound quality or speech perception. Currently, no widely available tools exist to aid real-time selection of different FATs. This study aims to develop a new smartphone tool for this purpose and to evaluate speech perception and sound quality measures in a pilot study of CI subjects using this application. METHODS: A smartphone application for a widely available mobile platform (iOS) was developed to serve as a preprocessor of auditory input to a clinical CI speech processor and enable interactive real-time selection of FATs. The application's output was validated by measuring electrodograms for various inputs. A pilot study was conducted in six CI subjects. Speech perception was evaluated using word recognition tests. RESULTS: All subjects successfully used the portable application with their clinical speech processors to experience different FATs while listening to running speech. The users were all able to select one table that they judged provided the best sound quality. All subjects chose a FAT different from the standard FAT in their everyday clinical processor. Using the smartphone application, the mean consonant-nucleus-consonant score with the default FAT selection was 28.5% (SD 16.8) and 29.5% (SD 16.4) when using a self-selected FAT. CONCLUSION: A portable smartphone application enables CI users to self-select frequency allocation tables in real time. Even though the self-selected FATs that were deemed to have better sound quality were only tested acutely (i.e., without long-term experience with them), speech perception scores were not inferior to those obtained with the clinical FATs. This software application may be a valuable tool for improving future methods of CI fitting.
PMCID:5556943
PMID: 28806335
ISSN: 1537-4505
CID: 2669212
Enhancing speech envelope by integrating hair-cell adaptation into cochlear implant processing
Azadpour, Mahan; Smith, Robert L
Cochlear implants (CIs) bypass some of the mechanisms that underlie normal neural behavior as occurs in acoustic hearing. One such neural mechanism is short-term adaptation, which has been proposed to have a significant role in speech perception. Acoustically-evoked neural adaptation has been mainly attributed to the depletion of neurotransmitter in the hair-cell to auditory-nerve synapse and is therefore not fully present in CI stimulation. This study evaluated a signal processing method that integrated a physiological model of hair-cell adaptation into CI speech processing. The linear high-pass adaptation process expanded the range of rapid variations of the electrical signal generated by the clinical processing strategy. Speech perception performance with the adaptation-based processing was compared to that of the clinical strategy in seven CI users. While there was large variability across subjects, the new processing improved sentence recognition and consonant identification scores in quiet in all the tested subjects with an average improvement of 8% and 6% respectively. Consonant recognition scores in babble noise were improved at the higher signal-to-noise ratios tested (10 and 6 dB) only. Information transfer analysis of consonant features showed significant improvements for manner and place of articulation features, but not for voicing. Enhancement of within-channel envelope cues was confirmed by consonant recognition results obtained with single-channel strategies that presented the overall amplitude envelope of the signal on a single active electrode. Adaptation-inspired envelope enhancement techniques can potentially improve perception of important speech features by CI users.
PMID: 27697486
ISSN: 1878-5891
CID: 2386102
A proposed mechanism for rapid adaptation to spectrally distorted speech
Azadpour, Mahan; Balaban, Evan
The mechanisms underlying perceptual adaptation to severely spectrally-distorted speech were studied by training participants to comprehend spectrally-rotated speech, which is obtained by inverting the speech spectrum. Spectral-rotation produces severe distortion confined to the spectral domain while preserving temporal trajectories. During five 1-hour training sessions, pairs of participants attempted to extract spoken messages from the spectrally-rotated speech of their training partner. Data on training-induced changes in comprehension of spectrally-rotated sentences and identification/discrimination of spectrally-rotated phonemes were used to evaluate the plausibility of three different classes of underlying perceptual mechanisms: (1) phonemic remapping (the formation of new phonemic categories that specifically incorporate spectrally-rotated acoustic information); (2) experience-dependent generation of a perceptual "inverse-transform" that compensates for spectral-rotation; and (3) changes in cue weighting (the identification of sets of acoustic cues least affected by spectral-rotation, followed by a rapid shift in perceptual emphasis to favour those cues, combined with the recruitment of the same type of "perceptual filling-in" mechanisms used to disambiguate speech-in-noise). Results exclusively support the third mechanism, which is the only one predicting that learning would specifically target temporally-dynamic cues that were transmitting phonetic information most stably in spite of spectral-distortion. No support was found for phonemic remapping or for inverse-transform generation.
PMID: 26233005
ISSN: 1520-8524
CID: 2689882
Electrode Selection and Speech Understanding in Patients With Auditory Brainstem Implants
McKay, Colette M; Azadpour, Mahan; Jayewardene-Aston, Deanne; O'Driscoll, Martin; El-Deredy, Wael
OBJECTIVES: The objective of this study was to evaluate whether speech understanding in auditory brainstem implant (ABI) users who have a tumor pathology could be improved by the selection of a subset of electrodes that were appropriately pitch ranked and distinguishable. It was hypothesized that disordered pitch or spectral percepts and channel interactions may contribute significantly to the poor outcomes in most ABI users. DESIGN: A single-subject design was used with five participants. Pitch ranking information for all electrodes in the patients' clinic maps was obtained using a pitch ranking task and previous pitch ranking information from clinic sessions. A multidimensional scaling task was used to evaluate the stimulus space evoked by stimuli on the same set of electrodes. From this information, a subset of four to six electrodes was chosen and a new map was created, using just this subset, that the subjects took home for 1 month's experience. Closed-set consonant and vowel perception and sentences in quiet were tested at three sessions: with the clinic map before the test map was given, after 1 month with the test map, and after an additional 2 weeks with their clinic map. RESULTS: The results of the pitch ranking and multidimensional scaling procedures confirmed that the ABI users did not have a well-ordered set of percepts related to electrode position, thus supporting the proposal that difficulty in processing of spectral information may contribute to poor speech understanding. However, none of the subjects benefited from a map that reduced the stimulation electrode set to a smaller number of electrodes that were well ordered in place pitch. CONCLUSIONS: Although poor spectral processing may contribute to poor understanding in ABI users, it is not likely to be the sole contributor to poor outcomes.
PMID: 25668392
ISSN: 1538-4667
CID: 2689892