<?xml version="1.0" encoding="UTF-8"?>
<collection xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/MARC21/slim http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd" xmlns="http://www.loc.gov/MARC21/slim">
 <record>
  <leader>00000ctm a22000004a 4500</leader>
  <controlfield tag="001">UP-99796217610382301</controlfield>
  <controlfield tag="003">Buklod</controlfield>
  <controlfield tag="005">20120628112629.0</controlfield>
  <controlfield tag="006">m    |o  d |      </controlfield>
  <controlfield tag="007">ta</controlfield>
  <controlfield tag="008">120628s        xx     d     r    |||| u|</controlfield>
  <datafield tag="035" ind1=" " ind2=" ">
   <subfield code="a">(iLib)UPD-00187102700</subfield>
  </datafield>
  <datafield tag="040" ind1=" " ind2=" ">
   <subfield code="a">DENGII</subfield>
  </datafield>
  <datafield tag="041" ind1=" " ind2=" ">
   <subfield code="a">eng</subfield>
  </datafield>
  <datafield tag="042" ind1=" " ind2=" ">
   <subfield code="a">DMLUC</subfield>
  </datafield>
  <datafield tag="090" ind1=" " ind2=" ">
   <subfield code="a">LG 995 2011 C65</subfield>
   <subfield code="b">O54</subfield>
  </datafield>
  <datafield tag="100" ind1="1" ind2=" ">
   <subfield code="a">Ong, Darrel Alvin N.</subfield>
  </datafield>
  <datafield tag="245" ind1="1" ind2="0">
   <subfield code="a">Automated content scoring of Filipino essays using concept indexing</subfield>
   <subfield code="c">Darrel Alvin N. Ong.</subfield>
  </datafield>
  <datafield tag="264" ind1=" " ind2="1">
   <subfield code="a">2011.</subfield>
  </datafield>
  <datafield tag="300" ind1=" " ind2=" ">
   <subfield code="a">x, 55 p.</subfield>
   <subfield code="b">ill.</subfield>
  </datafield>
  <datafield tag="502" ind1=" " ind2=" ">
   <subfield code="a">Thesis (M.S. Computer Science)--University of the Philippines, Diliman.</subfield>
  </datafield>
  <datafield tag="520" ind1="3" ind2=" ">
   <subfield code="a">Essay writing is writing a short composition about the personal views of its author on a given topic. It has been taught to every student and became a major part of formal education in the Philippines. It is also one of the effective methods to improve the language proficiency enriching the students? vocabulary as they write.    However, evaluating and scoring essays need considerable amount of time, which limits language teachers in providing more writing exercises. Also, teachers often experience boredom and fatigue especially when checking large amounts of essar resulting in inconsistent scores. These problems motivate the development of Automated Essay Graders (AEGs). A number of AEGs have already been developed for various languages but none so far for the Filipino language.    This study addresses these problems by developing the first automated content analysis for the Filipino Language. It is a computer application that automatically analyzes and scores content of Filipino essays. It captures the semantic meaning of each essay and scores them accordingly.    The corpus used were essays written by high school students from a public high school. These essays were checked by different teachers based on content, grammar, and organization. Only the scores for content were considered in this study.    Experiments were conducted to determine the effects of spell checking, stop words removal, stemming, sub-clustering and normalized weighting schemes. Different cases were considered to determine the optimum parameters that improve performance of the system. Results show that applying spell checking and stemming do not show significant improvement. However, stop words removal with raw term frequency normalized weighting scheme improves system accuracy. The system is implemented using Concept Indexing (CI), which is a relatively new dimensionality algorithm in the field of NLP. It is also implemented using Latent Semantic Indexing (LSI), which is the more common algorithm used by nist AEG systems for other languages.    Experiments were conducted for each teacher for CI and LSI using the optimum parameters from previous experiments. Results show that the relatively new algorithm, CI, outperform and is faster than LSI.    The system was compared to other human checkers to determine if the system can mimic the way teachers score an essay. Results show that one can not distinguish between the teacher and the computer mimicking the teacher?s style of essay grading.    Students will be given more writing exercises that will consequently improve their language proficiency. Lastly, Filipinos will be able to learn and understand more the Filipino language.</subfield>
  </datafield>
  <datafield tag="650" ind1=" " ind2="0">
   <subfield code="a">Essays</subfield>
   <subfield code="x">Indexes</subfield>
   <subfield code="x">Databases.</subfield>
  </datafield>
  <datafield tag="650" ind1=" " ind2="0">
   <subfield code="a">Latent semantic indexing.</subfield>
  </datafield>
  <datafield tag="650" ind1=" " ind2="0">
   <subfield code="a">Automatic indexing</subfield>
   <subfield code="x">Software.</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
   <subfield code="a">Automated Essay Grader (AEG).</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
   <subfield code="a">Concept indexing.</subfield>
  </datafield>
  <datafield tag="695" ind1=" " ind2="0">
   <subfield code="a">Filipino essays.</subfield>
  </datafield>
  <datafield tag="905" ind1=" " ind2=" ">
   <subfield code="a">FI</subfield>
  </datafield>
  <datafield tag="905" ind1=" " ind2=" ">
   <subfield code="a">UP</subfield>
  </datafield>
  <datafield tag="852" ind1="1" ind2=" ">
   <subfield code="a">UPD</subfield>
   <subfield code="b">DARCHIVES</subfield>
   <subfield code="h">LG 995 2011  C65</subfield>
   <subfield code="i">O54</subfield>
  </datafield>
  <datafield tag="852" ind1="1" ind2=" ">
   <subfield code="a">UPD</subfield>
   <subfield code="b">DENG-II</subfield>
   <subfield code="h">LG 995 2011  C65</subfield>
   <subfield code="i">O54</subfield>
  </datafield>
  <datafield tag="942" ind1=" " ind2=" ">
   <subfield code="a">Thesis</subfield>
  </datafield>
 </record>
</collection>
