<?xml version="1.0" encoding="UTF-8"?>
<collection xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/MARC21/slim http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd" xmlns="http://www.loc.gov/MARC21/slim">
 <record>
  <leader>00000ctm a22000003a 4500</leader>
  <controlfield tag="001">UP-99796217608810423</controlfield>
  <controlfield tag="003">Buklod</controlfield>
  <controlfield tag="005">20090505110425.0</controlfield>
  <controlfield tag="006">m    |o  d |      </controlfield>
  <controlfield tag="007">ta</controlfield>
  <controlfield tag="008">090505s        xx     d     r    |||| u|</controlfield>
  <datafield tag="035" ind1=" " ind2=" ">
   <subfield code="a">(iLib)UPD-00084797013</subfield>
  </datafield>
  <datafield tag="040" ind1=" " ind2=" ">
   <subfield code="a">DENGII</subfield>
  </datafield>
  <datafield tag="041" ind1=" " ind2=" ">
   <subfield code="a">eng</subfield>
  </datafield>
  <datafield tag="090" ind1=" " ind2="0">
   <subfield code="a">LG 993.5 2009 E64</subfield>
   <subfield code="b">A46</subfield>
  </datafield>
  <datafield tag="100" ind1="1" ind2=" ">
   <subfield code="a">Almonte, Reginald M.</subfield>
  </datafield>
  <datafield tag="245" ind1="1" ind2="0">
   <subfield code="a">Incorporating natural language processing techniques in Filipino speech recognition</subfield>
   <subfield code="c">Reginald M. Almonte, Michael Gringo Angelo R. Bayona, Patrick Simon T. Corrales.</subfield>
  </datafield>
  <datafield tag="264" ind1=" " ind2="1">
   <subfield code="a">2009</subfield>
  </datafield>
  <datafield tag="300" ind1=" " ind2=" ">
   <subfield code="a">119 leaves</subfield>
   <subfield code="b">ill.</subfield>
   <subfield code="e">1 computer laser optical disc (4 3/4 in.)</subfield>
  </datafield>
  <datafield tag="502" ind1=" " ind2=" ">
   <subfield code="a">Thesis (B.S., EEE)--University of the Philippines, Diliman.</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
   <subfield code="a">Previous projects in the UP Digital Signal Processing Laboratory have dealt with developing a system to recognize Filipino speech and convert it to text. Researchers, however, have observed that most of the word sequences generated the system do not resemble the natural form of the language. In this project, natural language processing (NLP) techniques were incorporated in the speech recognition system for continuous speech and observed whether it improved the performance of the system or not. The recognition system developed is composed of four blocks; the feature calculator, the phone-likelihood estimator, the speech decoder, and the natural language processor. The researchers investigated the performance of five different speech features through phoneme-level recognition. It was found out that among those tested, the Mel Frequency Cepstral Coefficients performed best, yielding a recognition rate of 73.57%. These speech features were then used in the feature calculation. Phone-likelihood estimation was done using a trained Multi-Layer Perception (MLP), and the output posterior probabilities were fed to a start-synchronous decoder which looked for the most probable word sequence. Tests in isolated word and continuous speech show that the system achieved a recognition rate of 71.12% and 9.95% respectively.For continuous-speech, NLP techniques on a syntactic level were employed to correct any errors in the output of the speech decoder. A stochastic word-level language model was used in this block. It verified if the generated word sequences are syntactically correct based on the Filipino language rules represented by the n-gram transitional probabilities. Finally, the proponents determined the viability of the implementing recognizer on a chip through system simulations in Simulink.</subfield>
  </datafield>
  <datafield tag="650" ind1=" " ind2="0">
   <subfield code="a">Computational linguistics.</subfield>
  </datafield>
  <datafield tag="650" ind1=" " ind2="0">
   <subfield code="a">Natural language processing (Computer science)</subfield>
  </datafield>
  <datafield tag="650" ind1=" " ind2="0">
   <subfield code="a">Automatic speech recognition.</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
   <subfield code="a">Filipino speech recognition.</subfield>
  </datafield>
  <datafield tag="700" ind1="1" ind2=" ">
   <subfield code="a">Bayona, Michael Gringo Angelo R.</subfield>
  </datafield>
  <datafield tag="700" ind1="1" ind2=" ">
   <subfield code="a">Corrales, Patrick Simon T.</subfield>
  </datafield>
  <datafield tag="905" ind1=" " ind2=" ">
   <subfield code="a">FI</subfield>
  </datafield>
  <datafield tag="905" ind1=" " ind2=" ">
   <subfield code="a">UP</subfield>
  </datafield>
  <datafield tag="852" ind1="0" ind2=" ">
   <subfield code="a">UPD</subfield>
   <subfield code="b">DENG-II</subfield>
   <subfield code="h">LG 993.5 2009 E64</subfield>
   <subfield code="i">A46</subfield>
  </datafield>
  <datafield tag="942" ind1=" " ind2=" ">
   <subfield code="a">Thesis</subfield>
  </datafield>
 </record>
</collection>
