Open Source Text Processing Project: PocketSphinx

PocketSphinx

Project Website: None
Github Link:

Description

PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop

This is PocketSphinx, one of Carnegie Mellon University’s open source large vocabulary, speaker-independent continuous speech recognition engine.

THIS IS A RESEARCH SYSTEM. This is also an early release of a research system. We know the APIs and function names are likely to change, and that several tools need to be made available to make this all complete. With your help and contributions, this can progress in response to the needs and patches provided.

Please see the LICENSE file for terms of use.

Prerequisites

You must have SphinxBase, which you can download from http://cmusphinx.sourceforge.net. Download and unpack it to the same parent directory as PocketSphinx, so that the configure script and project files can find it. On Windows, you will need to rename ‘sphinxbase-X.Y’ (where X.Y is the SphinxBase version number) to simply ‘sphinxbase’ for this to work.

Linux/Unix installation

In a unix-like environment (such as linux, solaris etc):

Build and optionally install SphinxBase. If you want to use fixed-point arithmetic, you must configure SphinxBase with the –enable-fixed option.

If you downloaded directly from the CVS repository, you need to do this at least once to generate the “configure” file:

$ ./autogen.sh
If you downloaded the release version, or ran autogen.sh at least once, then compile and install:

$ ./configure
$ make clean all
$ make check
$ sudo make install


Leave a Reply

Your email address will not be published. Required fields are marked *