The K-mean Clustering Analysis

.docx

School

Concordia University Portland *

*We aren’t endorsed by this school

Course

543

Subject

Computer Science

Date

Jun 24, 2024

Type

docx

Pages

8

Uploaded by chanduRapolu123

Report
Select a dataset from UCI Machine Learning Repository https://archive.ics.uci.edu/ml/datasets.php I’ve selected this Dataset https://archive.ics.uci.edu/dataset/53/iris Describe the dataset you select. I've chosen this dataset because it's a classic example used in statistical and machine learning studies. The Iris dataset includes three different species of iris flowers, with 50 samples for each species. Each flower has measurements for sepal length, sepal width, petal length, and petal width. I can analyze the relationships between these measurements to understand how they differ between species. I can also predict the species of an iris flower based on its measurements. This dataset is perfect for practicing classification techniques and exploring how different features contribute to identifying the species of iris flowers. Perform a K-Means cluster analysis on the variables of the dataset you select. I’m unable to preform k mean cluster so I’ve executed in R console After performing K-Means clustering on the Iris dataset, I got these values. The dataset includes measurements of sepal length, sepal width, petal length, and petal width for three iris species. The K-Means clustering grouped the flowers into three clusters with sizes 62, 38, and 50. The explained proportion of variance by the clusters is 88.43%, and the average silhouette score is 0.5528, indicating a reasonable clustering quality. The cluster centers are as follows: Cluster 1 has a sepal length of 5.90, sepal width of 2.75, petal length of 4.39, and petal width of 1.43. Cluster 2 has a sepal length of 6.85, sepal width of 3.07, petal length of 5.74, and petal width of 2.07. Cluster 3 has a sepal length of 5.01, sepal width of 3.43, petal length of 1.46, and petal width of 0.25. The performance metrics show a maximum diameter of 2.68, minimum separation of 0.26, and a Dunn Index of 0.099. I think this analysis shows that K-Means clustering can effectively group iris flowers based on their physical measurements, with reasonably distinct clusters.
Set up and run the model in JASP by selecting “Number of Clusters” values Similarly, like above I’ve executed R code and I tried visualizing the data
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help