Emotion Recognition in Speech Processing using Fast Fourier Transform
Veerendra Kumar Jammula1, Ashok Reddy Gogireddy2, Hari Kiran Vege3, Kolla Bhanu Prakasha4
1Veerendra Kumar Jammula, Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur, A.P., India.
2Ashok Reddy gogireddy, Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur, A.P., India.
3Hari Kiran Vege, Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur, A.P., India.
4Kolla Bhanu Prakash, Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur, A.P., India.
Manuscript received on November 11, 2019. | Revised Manuscript received on November 20 2019. | Manuscript published on 30 November, 2019. | PP: 11057-11060 | Volume-8 Issue-4, November 2019. | Retrieval Number: D5435118419/2019©BEIESP | DOI: 10.35940/ijrte.D5435.118419
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: The idea of acquiring the state of human emotions from one’s speech, we have gathered required data that makes one to understand the concept behind this process. Human emotions can be predicted by his/her facial expressions or by the tone of their voice. Reading the facial expressions is one of the major tasks involved in image processing. Likewise, each emotion holds different tone in one’s voice. It requires a various emotional tone frequency to calculate and analyse the emotions. We need to fetch approximate frequencies of emotions. It’s the challenging task as each speaker has various pitches while speaking and frequencies of the same person varies in his emotion. Another main issue is the noise in the input while a person is speaking, due to less quality recordings or surrounding environment. List of basic emotions are Happy, angry, sad, bored, surprised, disgust, fear. For this project the prior important concept is speech recognition. The machine must be capable of reading the input in form as speech and must be capable of analysing various contents. The input given is converted into wav format. At the same time machine must be also capable of fetching the frequencies. The calculation is performed using many methodologies that are defined.
Keywords: Emotions, Emotion Recognition, FFT, Frame rate, Frequency.
Scope of the Article: Image Processing and Pattern Recognition.