====== Implementations ====== A list of existing implementations that you can use for projects. How to find them? Talking with people, Google, newsgroups (comp.ai.*), Google Groups (face-rec, ), and the references at the end of this page. ===== Learning ===== * [[http://sourceforge.net/project/showfiles.php?group_id=177723|FGA]]: Fast Genetic Algorithms * [[http://htk.eng.cam.ac.uk/|HTK]]: Hidden Markov Model Toolkit (C) * [[http://www.cs.waikato.ac.nz/ml/weka/|Weka]]: machine learning for data mining ==== References ==== * [[http://mloss.org/software/|Machine Learning Open Source Software]]: forum for open source software in machine learning ===== Perception ===== ==== Vision ==== === Face detection/recognition === * [[http://opencvlibrary.sourceforge.net/|OpenCV library]]: face detection algorithms (C/C++) * [[http://www.ktl.elf.stuba.sk/~beszedes/home/index.php?page=software-download|Beszedes]]: Face / Facial features / Expression recognition (C++ / C), skin detection (Matlab) * [[http://www.cs.colostate.edu/evalfacerec/algorithms5.html|CSU Face Identification Evaluation System]]: 4 baseline face recognition algorithms (C), with training data * [[http://sourceforge.net/projects/malic/|Malic]]: face recognition * [[http://www.inf.ufrgs.br/~gfranciscatto/autofaro/|AutoFaro]], [[http://www.esnips.com/doc/835f5710-ada9-4580-868c-25c5bd6b54d6/Autofaro|AutoFaro]]: face recognition with eigenfaces === Misc === * [[http://www.cs.wisc.edu/computer-vision/projects/gsnake.html|GSnake]]: * [[http://openvidia.sourceforge.net/|OpenVidia]]: GPU accelerated Computer Vision Library * [[http://vxl.sourceforge.net/|VXL]]: Vision-something-libraries * [[http://www.mavrinac.com/projects/imageseg/index.html|Color Image Segmentation]]: using controlled rival-penalized competitive learning in multiple color spaces (C++) * [[http://www.ces.clemson.edu/~stb/klt/|KLT Tracker]]: Kanade-Lucas-Tomasi tracker ==== Speech ==== === Recognition === * [[http://www.fon.hum.uva.nl/praat|Praat Phonetics Toolkit]] * [[http://cmusphinx.sourceforge.net/html/cmusphinx.php|CMUSphinx]]: speech recognition engine * [[http://htk.eng.cam.ac.uk/|HTK]]: Hidden Markov Model Toolkit * [[http://www.nist.gov/speech/tools/|CMUSec]]: Acoustic Segmentation * [[http://www.nist.gov/speech/tools/|tsylb2]]: Syllabification software (C) * [[http://www.icsi.berkeley.edu/Speech/papers/qio/|Qualcomm-ICSI-OGI]]: speech detection, Wiener filter noise reduction, nonspeech frame dropping * [[http://www.icsi.berkeley.edu/Speech/papers/gabor/|Kleinschmidt]]: Gabor Feature Extraction for Automatic Speech Recognition * [[http://cslr.colorado.edu/beginweb/speech_recognition/sonic.html|Sonic]]: Large Vocabulary Continuous Speech Recognition System (C++) * [[http://nico.nikkostrom.com/|NICO Toolkit]]: artificial neural network (ANN) toolkit designed and optimized for speech technology applications === Synthesis === * [[http://mary.dfki.de/|MARY]]: Text to speech system (emotions) * [[http://www.cstr.ed.ac.uk/projects/festival/download.html|Festival]]: framework for building speech synthesis systems ==== Misc ==== * [[http://www.jjj.de/fxt/FXT]]: Fast transforms and low level algorithms * [[http://www.cstr.ed.ac.uk/projects/speech_tools/|The Edinburgh Speech Tools Library]]: collection of C++ class, functions and related programs for manipulating the sorts of objects used in speech processing * [[http://sp-tk.sourceforge.net/|SPTK]]: Speech Signal Processing Toolkit * [[http://nltk.sourceforge.net/|NLTK]]: Natural Language Toolkit (Python) * [[http://mallet.cs.umass.edu/|MALLET]]: Advanced Machine Learning for Language * [[http://ilk.uvt.nl/timbl/|TiMBL]]: Tilburg Memory-Based Learner ==== Corpus ==== * [[http://www.voxforge.org/|VoxForge]]: speech corpus * [[http://www.isca-students.org/corpora]] ==== References ==== * [[http://www.isca-students.org/resources]] * [[http://www.icsi.berkeley.edu/Speech/papers/gelbart-ms/pointers/]] * [[http://mallet.cs.umass.edu/index.php/Similar_software|Machine learning and text]] * [[http://homepages.inf.ed.ac.uk/s0450736/maxent.html#soft|Maximum entropy software]] * [[http://www.elis.ugent.be/cost249/td95002.htm|Speech Processing Software]]