Useful links
A humble selection of audio/speech technology related links. Far from being complete, and will hopefully grow over time.
- MIR PhD list: list of PhD theses and doctoral dissertations related to Music Information Retrieval.
- MIR tools: slightly out of date list of MIR tools.
- Latest papers on speaker diarization: the Special Section on New Frontiers in Rich Transcription
- LIUM Speaker Diarization Wiki:LIUM_SpkDiarization is a software dedicated to speaker diarization (ie speaker segmentation and clustering). It is written in Java, and includes the most recent developments in the domain
- Auditory toolbox:this toolbox will be useful to researchers that are interested in how the auditory periphery works and want to compare and test their thesis
- http://videolectures.net/:exchange ideas and share knowledge
- SAFE:a toolkit using a statistical algorithm for F0 estimation
- Machine Leaning Toolbox:This toolbox provides a number of essential functions for machine learning, especially for data clustering and pattern recognition
- VOICEBOX:Speech Processing Toolbox for MATLAB.
Research Groups & Institutes
(By no means complete, please contact me if you'd like to have your group/institute added to the list).
Europe
- Audio Research Group (ARG), Tampere University of Technology, Tampere, Finland
- Multimedia and Geometry group (MG), Universiteit Utrecht, The Netherlands
- Human Media Interaction (HMI) , (EEMCS) at theUniversity of Twente.
- Metiss: speech and audio data modelling and processing, IRISA,France
- Centre for Digital Music (C4DM), Queen Mary, London, UK
North America
- Center for Computer Research in Music and Acoustics (CCRMA), Stanford University, Standford, CA, USA
- Interactive Audio Lab (IAL), Northwestern University, Evanston, IL, USA
- Laboratory for the Recognition and Organization of Speech and Audio (LabROSA), Columbia University, New York, NY, USA
- The Media Lab, Massachusetts Institute of Technology (MIT), Cambridge, MA, USA
- The Johns Hopkins Center for Language and Speech Processing (CLSP) ,USA
- School of Computer Science, Carnegie Mellon,USA
- Signal, Speech and Language Interpretation (SSLI) Lab, University of Washington,USA
- OSU Laboratory for AI Research (LAIR), Department of Computer Science and Engineering (CSE), and Center for Cognitive Science
- Speech Processing and Auditory Perception Laboratory,UCLA,USA