UPDATE : Source code has moved to a git repository, http://github.com/dilawar/sound . The project README file has the updated instructions. Instructions here may be outdated. This post is for demo purposes only.
This work is a derivative of Dr. Taylor’s work http://people.msoe.edu/~taylor/examples/wav.htm . There was a bug in his implementation which is removed from this implementation.
A C++ class to read WAV file and store its content in an array. We also store these values in an ASCII file and plot it using gnuplot
to check whether we get it right. praat
can be used to confirm whether our output is right or not. Google to read about the WAV format. We only support a non-compressed WAV file.
WAV File Format
We do not support the compressed WAV file. The format which is programmed here is available here https://ccrma.stanford.edu/courses/422/projects/WaveFormat/.
Implementation
The Makefile project can be found at github repository http://github.com/dilawar/sound . After compilation (using make all)
, you’ll get the binary parseSpeech
. Use this binary ./parseSpeech -f
. If your wav file is long (more than 3-4 seconds) the txt file generated by this program speechData.txt
will be very lengthy. Use can use gnuplot
to draw the waveform.
Woo, It’s a great job, Thank you1
Original link down. License any more permissive than default copyright?
Looks like GPL based on code comments
If you help me correct the lincensing errors, I’d be grateful. I’ve added GPLv2 to the repo.
I just checked the logs. The original source is also GPLv2 (http://www.ee.calpoly.edu/media/uploads/resources/WAV_Only_SRC_07-2010.zip). On master branch, however, things has changed drastically. I’ll probably modify the repo to maintain the GPLV2.
I am really not sure about the license stuff.