<?xml version="1.0" encoding="UTF-8"?>
<collection xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/MARC21/slim http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd" xmlns="http://www.loc.gov/MARC21/slim">
 <record>
  <leader>00000nam a22000004a 4500</leader>
  <controlfield tag="001">UP-1685675941131510167</controlfield>
  <controlfield tag="003">Buklod</controlfield>
  <controlfield tag="005">20240229142243.0</controlfield>
  <controlfield tag="006">m    |o  d |      </controlfield>
  <controlfield tag="007">ta</controlfield>
  <controlfield tag="008">240213s2023    xxu     r    |||| u|eng d</controlfield>
  <datafield tag="040" ind1=" " ind2=" ">
   <subfield code="a">DSTC</subfield>
   <subfield code="e">rda</subfield>
  </datafield>
  <datafield tag="041" ind1=" " ind2=" ">
   <subfield code="a">eng</subfield>
  </datafield>
  <datafield tag="090" ind1=" " ind2=" ">
   <subfield code="a">LG 995 2023 S8</subfield>
   <subfield code="b">A58</subfield>
  </datafield>
  <datafield tag="100" ind1="1" ind2=" ">
   <subfield code="a">Antonio, Czarinne Antoinette A.</subfield>
   <subfield code="e">author</subfield>
  </datafield>
  <datafield tag="245" ind1="1" ind2="0">
   <subfield code="a">Modified silhouette score for evaluating cluster solutions</subfield>
   <subfield code="c">Czarinne Antoinette A. Antonio.</subfield>
  </datafield>
  <datafield tag="264" ind1=" " ind2="1">
   <subfield code="c">c2023</subfield>
   <subfield code="a">Quezon City</subfield>
   <subfield code="b">University of the Philippines</subfield>
  </datafield>
  <datafield tag="300" ind1=" " ind2=" ">
   <subfield code="a">93 p.</subfield>
  </datafield>
  <datafield tag="500" ind1=" " ind2=" ">
   <subfield code="a">Thesis (M.S.) --University of the Philippines, Diliman.</subfield>
  </datafield>
  <datafield tag="520" ind1="0" ind2=" ">
   <subfield code="a">ABSTRACT&#13;
&#13;
The assessment of the quality of a clustering solution and the proper identification of the number of clusters to be used is crucial step in doing cluster analysis. A class of silhouette-based indices, as modification to the widely used silhouette index, is developed to measure cluster validity. The performance of the proposed indices is demonstrated via a simulation study and through the application to actual data sets. The results revealed that the use of the second and third nearest cluster in the computation instead of just the nearest neighboring cluster relative to an observation was advantageous in identifying the number of natural clusters as a viable choice in the cluster analysis. Any of the proposed indices were useful in the presence of noisy data and not well-separated clusters. Further, dimension reduction techniques employed in the calculation of the distance measures provided an added benefit when dealing with high-dimensional data.</subfield>
  </datafield>
  <datafield tag="650" ind1=" " ind2="0">
   <subfield code="a">Cluster analysis.</subfield>
  </datafield>
  <datafield tag="650" ind1=" " ind2="0">
   <subfield code="a">Cluster validation.</subfield>
  </datafield>
  <datafield tag="650" ind1=" " ind2="0">
   <subfield code="a">Silhouette index.</subfield>
  </datafield>
  <datafield tag="905" ind1=" " ind2=" ">
   <subfield code="a">UP</subfield>
  </datafield>
  <datafield tag="950" ind1=" " ind2=" ">
   <subfield code="a">Monograph</subfield>
  </datafield>
  <datafield tag="852" ind1="0" ind2=" ">
   <subfield code="a">UPD</subfield>
   <subfield code="b">DSTC</subfield>
   <subfield code="h">LG 995 2023 S8</subfield>
   <subfield code="i">A58</subfield>
  </datafield>
  <datafield tag="942" ind1=" " ind2=" ">
   <subfield code="a">Book</subfield>
  </datafield>
 </record>
</collection>
