Thread: Sound file to speech transcript library?

  1. #1
    Banned nickname_changed's Avatar
    Join Date
    Feb 2003
    Location
    Australia
    Posts
    986

    Sound file to speech transcript library?

    I've not given much thought to this before really, but has anyone come across a library or program that can take a sound file (say an MP3) and extract all the speech (a sort of transcription). Whether it is grammatically correct or anything like that I don't care, I just want it to list all of the words used so I can search through them later.

    The reason I'm asking is when I go to uni I would like to record all of the lectures etc and store them in a database, and later in the year to be able to search through all the recordings for information on a particular subject.

    Any information/resources/links would be great! If I can't find a library (C++, COM, .NET, any) I'll attempt to write something myself.

    Thanks, and if I can throw something together I'll definately give it to anyone thats interested.

  2. #2
    S Sang-drax's Avatar
    Join Date
    May 2002
    Location
    Göteborg, Sweden
    Posts
    2,072
    I doubt it.
    Some programs offer speech recogition, but they cost money and they need to be trained a lot before they become any good.
    Last edited by Sang-drax : Tomorrow at 02:21 AM. Reason: Time travelling

  3. #3
    Linguistic Engineer... doubleanti's Avatar
    Join Date
    Aug 2001
    Location
    CA
    Posts
    2,459
    >I'll attempt to write something myself.

    DIY! =)

    It's really fun stuff to try to get into, which is why I'm doing it for research this year. You get to learn your DFT's in and out (which you need anyway if you're going into EE) and it's really neat. But it requires linguistics! Read read read... I'm trying to do it right now, just gotta get past all the mathy DFT stuff first, almost at a nice, evenly distributed (in frequency) spectrograph.

    Pick up any fundamentals of speech processing book to learn about formants, and formant movement and pick up an intro phonology book to learn about the IPA. It's so fun!

    Anyone doing anything remotely interdisciplinary excites me, especially speech processing!
    hasafraggin shizigishin oppashigger...

  4. #4
    Crazy Fool Perspective's Avatar
    Join Date
    Jan 2003
    Location
    Canada
    Posts
    2,640
    go to http://sourceforge.net/search/ , type in "Speech recognition" and check out the source project of your choice.

  5. #5
    Yes, my avatar is stolen anonytmouse's Avatar
    Join Date
    Dec 2002
    Posts
    2,544
    Check out HP Speechbot for what can be done in this field. Also try a search on speech recognition library.

  6. #6
    The Defective GRAPE Lurker's Avatar
    Join Date
    Feb 2003
    Posts
    949
    Do you want the program to recognize only english sounds, or many different sounds? There are quite a few sounds out there, so you may want to try only english first.

    Doubleanti: I tought myself how to pronounce ejectives today . I'm happy happy.... Love that s'!
    Do not make direct eye contact with me.

  7. #7
    Linguistic Engineer... doubleanti's Avatar
    Join Date
    Aug 2001
    Location
    CA
    Posts
    2,459
    Haha, good job~! :ats you on head::

    I assume you mean non-English ejectives haha, else I'd say of course you knew how to... =)

    Go for a Cambodian injective now, enjoy! haha...
    hasafraggin shizigishin oppashigger...

  8. #8
    Banned nickname_changed's Avatar
    Join Date
    Feb 2003
    Location
    Australia
    Posts
    986
    Thanks for all the links guys, I found a few librarys that may do what I want, but they all seem to be in Java :P

    Lurker: yep, just English sounds and words. I don't even need it to recognise punctuation, if it just outputs something like:
    Code:
    hi john how are you i am well thanks peter today we will be talking 
    to you about the wonderful work being conducted by MIT into 
    the benefits of crop rotation for coconut farmers
    Just so that I have some list of keywords to seach on. Then I can list the closer looking matches.

    Will let you know how it goes, thanks for the links!
    Last edited by nickname_changed; 12-17-2004 at 09:57 AM.

  9. #9
    Yes, my avatar is stolen anonytmouse's Avatar
    Join Date
    Dec 2002
    Posts
    2,544
    Sphinx 2 & 3 are coded in C.

Popular pages Recent additions subscribe to a feed

Similar Threads

  1. Formatting a text file...
    By dagorsul in forum C Programming
    Replies: 12
    Last Post: 05-02-2008, 03:53 AM
  2. gcc link external library
    By spank in forum C Programming
    Replies: 6
    Last Post: 08-08-2007, 03:44 PM
  3. Possible circular definition with singleton objects
    By techrolla in forum C++ Programming
    Replies: 3
    Last Post: 12-26-2004, 10:46 AM
  4. DirectSound - multiple sounds
    By Magos in forum Game Programming
    Replies: 9
    Last Post: 03-03-2004, 04:33 PM
  5. System
    By drdroid in forum C++ Programming
    Replies: 3
    Last Post: 06-28-2002, 10:12 PM