EC347 Speech and Audio Processing
Course Name:
EC347 Speech and Audio Processing
Programme:
Category:
Credits (L-T-P):
Content:
Speech Production–human speech production mechanism, acoustic theory of speech production, digital models for speech production. Speech perception– human hearing, auditory psychophysics, JND, pitch perception, auditory masking, models for speech perception. Speech Analysis–Time and frequency domain analysis of speech, speech parameter estimation, Linear prediction. Speech compression–quality measures, waveform coding, source coders, Speech compression standards for personal communication systems. Audio processing–characteristics of audio signals, sampling, Audio compression techniques, Standards for audio compression in multimedia applications, MPEG audio encoding and decoding, audio databases and applications. Speech synthesis–text to speech synthesis, letter to sound rules, syntactic analysis, timing and pitch segmental analysis. Speech recognition–Segmental feature extraction, DTW, HMMs, approaches for speaker, speech and language recognition and verification