Professional Documents
Culture Documents
KeywordsSpeech-To-Text, Text-To-Speech Converter The Application which are going to make will use SR with
(Both Side), Contacts Selection with Numeric and Google server which uses HMM model. A detail description
Alphabets about working of this system is as follows: Initially speech
taken as input and recorded by mice .When user speaks
sounds will be fluctuating in a form signals, fluctuations of
signal depends on users quality of voice. Input speech will
divided into different set of words, which are in different set
of frames. First input is inputted, sound can be fluctuating extraction, acoustic model, dictionary, speech recognition
set of signals which are recorded. Then these words process algorithm and language model. Input speech first convert
by system to execute it and convert it in desire text. Speech into digital signals, then it divided into small intervals.
will recognize by using different methods such as feature These digital signals then process by using an algorithm.
III. OVERVIEW OF SYSTEM increases at a high level. Speech recorded a recorder .After a
recording a done, speech divided into set of frames or words
For the visually impaired peoples it is not easy to handle that and every words and phrases works as independently
particular speech icon so here it is a problem to implement Additional sounds comes with speech is filtered by a MFCC
such kind of the application is accessible for impaired model, so that it can be easily understood by a system
peoples. The technologies and algorithm use for this .Background voice and low quality voice all should be filter
application are such as: HMM (Hidden Markov model), to convert it into desired text. Then algorithm is used for
MFCC (Mel Frequency Cepstrum coefficient), android, making a conversion from speech to text at sender site.
forward algorithm, SMS manager class, java script, N-gram These converted texts send to receivers.
Database and Artificial Intelligence. Speech and text to be
understood by the system is now popularly called as Speech IV. SYSTEM ARCHITECTURE
recognition (SR).Different types of speech are as follows:
Connected Words: Separate utterances together with A. Working of the System
minimum pause are input requirement of this system.
Continuous Speech: A dictation by computer to the speaker, When application initiates, it ask contact number from user.
it is the most difficult recognizers to create. Spontaneous User can provide contact number from two way; either it
speech: Speakers natural speech acts as the input for the can be manual or voice based. Below contact filed their be a
system. It needs careful speaker, otherwise it generates message field which we are going to sent a receiver. Sender
excessive error. speaks a message that would be converted at a sender site
into text.
A. Existing System
Message input is converted into speech to text and vice
Speech recognition adds tremendous changes Into the versa by using message converter. HMM is most successful
classic keyboard input which leads to the manipulation of and most flexible approach to speech recognition. It is used
text is easier the the classic method. This application uses to send SMS,
the Google API which uses the hidden Markov models
(HMM) method. HMM use to send message to receivers in HMM is state independent i.e current state, past state and
this application the speech is recorded and user selects the future state are calculated independently. HMM method is
contacts from their list of contacts and then send a SMS to basically used for recognition of speech. It converts a speech
specific person. First Software was developed 1994 was into text. This method is more flexible and efficient method
dictation software, which is based on discrete speech. if we used it properly. It keeps different states of HMM
Discrete speech works slowly and not a natural means of independent from each other to make a proper or desire
communication, after every word spoke, it needs a pause.2 nd pattern. MFCC is used for extracting a feature. This method
speech based software developed by IBM which is based on is used to filter a speech, so that it must be understandable
continuous speech. Continuous speech based system was by a system.
very flexible and a natural conversation, but it was too
expensive and needs a costly PCs.
B. Proposed System
2) Hidden Markov Model elements are as follows Compact: It is not compact; it requires many
states as vocabulary increases.
HMM elements can be categorized as follows:
1. Number of state N General: this HMM cant make new. Words HMM are
2. Number of distinct observation symbol per state M, V having very good calculations structures. This model are
= V 1, V 2, , V M using in a wide range of applications, If we use this model
properly it works in an efficient manner. It can perform a
3. State transition probability, a i = P [q t+1 = S i |q t complex calculations are very rich in mathematical structure
= S j ], 1 i, j N and hence can form the theoretical basis for use in a wide
4. Observation symbol probability distribution in each state range of application.
j,B j (K) = P [V k at t|q t = S j ]
MFCC is a method used in speech recognition for feature extraction. Before MFCC LPC were available,but it has lots of
drawbacks which overcome by MFCC. It uses a frequency domain, which I s more accurate than a time domain. MFCC can be
derived from FFT.
F. Technologies, Method and Algorithms: algorithm which makes use of NLP,SoundX selects best
possible match words
HMM[Hidden Markov Model]: User can select a multiple contacts of same person to
reduce multiple reduction
Most successful and most flexible approach to speech It recognizes the speech to a more than 90%
recognition. IT is used to send SMS. HMM feature is that accuracy,delay form recognition is less than 100ns it
it's state are independent i.e current state,past state and gives a voice guidance for direction and destination of
future state are calculated independently. moving,
It gives alarm services and calling services phone
MFCC[Mel Frequency Cepstrum coefficient]: number can be selected manually or by using a voice.
Timer for unread message, notification and alerts are
It extracts features and also select parametric provided when new message arrives,the timer will
representation remind after a time period to read unread message.
User can monitor their voice signal level by a red signal
N-gram DataBase: bar.
It is complete ,open and free platform. This application is only based on English language.
It is used for efficient output sequence. We are extremely thankful to our guide Prof. ANSARI
MUKHTAR AMIR for their valuable guidance and for
Viterbi Algorithm: providing all the necessary facilities, which were
indispensable in the completion of this project report. We
To get better observe state. are also thankful to Department of Computers of Anjuman-
i-islam Kalsekar Campus,New Panvel for their valuable
Baum Welch Algorithm: time, support, comments, suggestions and persuasion.
required facilities , Internet access and important books.
To choose computing parameters.
REFERENCES
AI(Artificial Intelligence) :
[1]. Intelligence Hands-Free Speech based system on
It is used for check validity of input speech for phone. android. Institute of Electrical and Electronics
Engineers (issued on : 11 April 2016 ).
SMS manager class: [2]. Android Speech to Text Converter for SMS Application
(IOSR Journal of Engineering Mar. 2012, Vol. 2(3) pp:
It is provided by android to handle SMS default activity. 420-423).
[3]. Android text messaging application for visually
JavaScript: impaired people (IRACST Engineering Science and
Technology: An International Journal (ESTIJ), ISSN:
for recording panel we use javascript 2250-3498,Vol.3, No.1,February 2013.
[4]. A REVIEW ON SPEECH TO TEXT CONVERSION
JavaApplet: METHODS International Journal of Advanced
Research in Computer Engineering & Technology
Pure JavaApplet is used button to record. (IJARCET) Volume 4 Issue 7, July 2015.
[5]. International Journal of Innovative Research in Science,
Engineering and Technology (An ISO 3297: 2007
Eclipse WorkBench:
Certified Organization) VOL. 4, ISSUE 7, JULY 2015.
[6]. Intelligent Hands Free Speech based SMS System on
It is used for text reconsecration.
Android(The Master of IEEE Projects Copyright
2015 LeMeniz Infotech).
G. Advantage
[7]. International Journal of Innovative Research in
Science,Engineering and Technology(An ISO 3297:
It uses special technologies do it must be very fast and
2007 Certified Organization) Vol. 4, Issue 7, July 2015.
almost 100% correct to be understandable used SoundX