Thread: speech identification

  1. #1
    Registered User subdene's Avatar
    Join Date
    Jan 2002

    speech identification

    Hi all, I know this is quite a specialised topic but I was just wondering if any of the members might have a bit of advise for me if they have worked within the speech field before. I am writing a speaker identification system within C++. The feature I am using from the voiced data is the glottal period, which is unique to each speaker. The feature extraction module works greater, however I think I need to use some sort of stochastic model to create a probability as to whether speaker x, is speaker x based upon their live sample compared against all the others within the database. I was wondering whether I should use Hidden Markov Models for this, or distance templates. Each template that is produced from the extraction module contains the length of the glottal cycle in MS, and the time at which it occurred during enrolment. Any help on this would be appreciated… thanks. BTW, the system will be operating within Text-Dependant mode.
    Be a leader and not a follower.

  2. #2
    Registered User
    Join Date
    Sep 2001
    Doubleanti is quite into that kind of thing. He posted a similar thread a while ago, and all the speech-nerds (no offence - we're all nerds ) discussed it for a while. Do a board search. It was within the last couple of weeks, so it should be easy to find - it was in GD forum.

  3. #3
    former member Brain Cell's Avatar
    Join Date
    Feb 2004
    (no offence - we're all nerds )
    nerd also nurd ( P ) Pronunciation Key (nūrd)
    n. Slang
    1. A foolish, inept, or unattractive person.
    2. A person who is single-minded or accomplished in scientific or technical pursuits but is felt to be socially inept
    source :

    Not "all" of us are nerds
    My Tutorials :
    - Bad programming practices in : C
    - C\C++ Tips
    (constrcutive criticism is very welcome)

    - Brain Cell

  4. #4
    Bob Dole for '08 B0bDole's Avatar
    Join Date
    Sep 2004
    >unattractive person.

    That makes 95% of the people that visit these boards, a nerd....face the facts.

    I myself am very attractive. But I'm socially

  5. #5
    Registered User subdene's Avatar
    Join Date
    Jan 2002
    All right, thanks.
    Be a leader and not a follower.

  6. #6
    Rad gcn_zelda's Avatar
    Join Date
    Mar 2003
    I'm not socially inept or foolish, but I'm not sure how attractive I am :P

  7. #7
    Registered User
    Join Date
    Mar 2003
    I am not familiar with any of the terms you used, or if what I'm going to say is going to come across as stupid.

    I personally haven't implemented this yet, but I sat down and talked with a grad student who has implemented voice recognition software. They used a neural network model. To train the neural network, you speak into a microphone and say a couple of words. A bunch of calculations are done, and then afterwards when you speak into the microphone it can recognize a key set of words. It worked 80% of the time.

    I don't think an elegant solution exists outside of using neural networks (or some type of 'fuzzy logic' with thresholds which are basically just a special type of neural network anyway).
    See you in 13

Popular pages Recent additions subscribe to a feed

Similar Threads

  1. speech compression
    By rasesh_raz in forum C Programming
    Replies: 5
    Last Post: 09-28-2007, 01:56 AM
  2. Using MS Speech in BCPPB 6.0
    By MiraX33 in forum Windows Programming
    Replies: 0
    Last Post: 02-26-2006, 10:21 AM
  3. Speech coding, detecting pitch
    By subdene in forum C++ Programming
    Replies: 2
    Last Post: 11-24-2004, 09:35 PM
  4. Speech APIs
    By lightatdawn in forum A Brief History of
    Replies: 1
    Last Post: 07-16-2003, 08:04 PM
  5. Whom do you blame for the mistake in Pres. Bush's State of the Union speech?
    By zahid in forum A Brief History of
    Replies: 20
    Last Post: 07-15-2003, 07:03 AM