So I have two pieces of audio and I would like to see how similar they are... i.e. determine if a person is saying the same thing in each piece of audio. I've heard from many places that generating a DFT can help, however I'm not 100% certain what the FFT outputs, and how I can use that to help me. Any advice is appreciated.
Thanks.