Simple Project List Software Map

132 projects in result set
Última Atualização: 2019-04-21 23:51

Julius

Julius is an open-source, high-performance large vocabulary continuous speech recognition (LVCSR) engine for speech-related researchs and developments. With HMM acoustic model and language model, you can construct your own speech recognition system.

Moved to github: https://github.com/julius-speech/julius

Desenvolvimento Estado: 4 - Beta, 5 - Production/Stable
Destinado Audiência: Developers, End Users/Desktop
Linguagem Natural: English, Japanese
Sistema Operacional: Linux, Windows, OS Independent
Linguagem de Programação: C
Interface de Usuário: Console (Text Based)
Register Date: 2002-09-09 14:38
Última Atualização: 2011-12-26 14:04

linphone

Linphone is an audio and video Internet phone with GTK+ and console interfaces. It uses the SIP protocol, and is compatible with most SIP clients and gateways. It can use various audio and video codecs such as Speex, GSM, G711, G722, ilbc, amr, Theora, H263-1998, MPEG4, H264, VP8, and snow.

Última Atualização: 2018-12-25 17:07

Open JTalk

Open JTalk は、修正BSDライセンスの元で配布されている日本語テキスト音声合成システムです。Open JTalk は、オープンソースの形態素解析エンジンの MeCab(和布蕪、めかぶ)、奈良先端大学を中心にして開発された形態素解析用辞書の naist-jdic、隠れマルコフモデル(HMM)に基づく音声合成エンジン hts_engine を用いています。

Última Atualização: 2008-07-24 11:29

Speex

Speex is a patent-free compression format designed especially for speech. It is specialized for voice communications at low bit-rates in the 2-45 kbps range. Possible applications include Voice over IP (VoIP), Internet audio streaming, audio books, and archiving of speech data (e.g. voice mail).

Última Atualização: 2009-03-25 07:41

FAAC

The FAAC project includes the AAC encoder FAAC and decoder FAAD2. It supports several MPEG-4 object types (LC, Main, LTP, HE AAC, PS) and file formats (ADTS AAC, raw AAC, MP4), multichannel and gapless en/decoding as well as MP4 metadata tags. The codecs are compatible with standard-compliant audio applications using one or more of these profiles.

Última Atualização: 2019-09-28 21:14

sourcesinc

信号と知能の研究室からソースコード http://fich.unl.edu.ar/sinc

(Machine Translation)
Última Atualização: 2018-05-30 16:31

WaveSurfer

WaveSurfer は、サウンドの視覚化と操作のためのオープン ソースのツールです。スピーチ/サウンド解析。アノテーション/転写などの典型的なアプリケーションです。WaveSurfer プラグインによって拡張可能で、他のアプリケーションに埋め込むこともできます。

Última Atualização: 2005-11-14 13:35

PHP Voice

PHP Voice (formerly known as PHP VXML) contain four classes that assist in developing voice application using PHP. It supports Speech Synthesis Markup Language 1.0, Speech Recognition Grammar Specification 1.0, Voice Browser Call Control: CCXML 1.0, and Voice Extensible Markup Language (VoiceXML) 2.0.

Última Atualização: 2007-10-10 13:37

FlowDesigner

FlowDesigner is a data flow-oriented development environment. It can be used to build complex applications by combining small, reusable building blocks. In some ways, it is similar to both Simulink and LabView, but is hardly a clone of either.

(Machine Translation)
Última Atualização: 2008-12-23 17:37

eSpeak

eSpeak is a compact text to speech engine for good
quality English and other languages. Its clear
articulation and good intonation makes it suitable
for listening to long text articles. It can speak
text files from the command line, and also
operates as a "talker" within the KDE TTS system
and with a Gnome Speech driver, as an alternative
to Festival or other similar programs. Windows
SAPI5 and command line versions are also available.

(Machine Translation)
Última Atualização: 2013-03-03 19:13

MisterHouse

MisterHouse is a Unix/Windows home automation program written in Perl. It can respond to voice commands, Web browsers, time of day, serial port and X10 data, external files, etc., and can speak via Text to Speech engines.

Última Atualização: 2004-09-28 09:44

Sphinx-4

Sphinx-4 is a speaker-independent, continuous speech recognition system.

Última Atualização: 2013-11-14 02:07

CMU Sphinx

CMU Sphinx, a Speech Recognition System, is transitioning to Open Source. The distribution contains a library (libsphinx2) and some small examples that link against it.

Última Atualização: 2009-10-16 22:23

Imptalk

Real time communication software built to provide face-to-face advantages to remote gamers.

(Machine Translation)
Database Environment: SQLite
Linguagem Natural: English
Sistema Operacional: MacOSX, Linux, Windows
Linguagem de Programação: C++, Python
Interface de Usuário: wxWidgets
Última Atualização: 2012-11-02 21:41

SpeechLion

SpeechLion is a speech recognition application for
desktop command and control. It is based on the
Sphinx-4 recognizer, and it allows the user to
control the Linux desktop using simple spoken
commands. Some example commands are "browse
google", "mouse click", "next window", "show
help", and "volume mute". SpeechLion recognizes
high-level commands for Web browsing via Firefox,
simple Emacs usage, window control, volume
control, and more. It also has low-level commands
for ad-hoc keyboard shortcuts and mouse actions.

(Machine Translation)