UH-DMML Group

Master's Thesis Title:  Region Discovery Using Supervised Clustering
Author:  Kim Keen Wee
Thesis Advisor:  Dr. Christoph F. Eick

Datasets

4 Wyoming Datasets created based on Census Data 2000

Visualization of the 4 Wyoming Datasets created (doc)

Steps for Creating Wyoming Population Dataset (ppt)


Programs for Visualization

1) To visualize the dataset file
     display.java   readme.txt
     > java display filename.txt
     Sample input datasets: income_495.txt, Complex1.txt
     Output examples: income_495.GIF, Complex1.GIF

2) To visualize a clustering
     vs2.java   readme.txt
     samples input files:
     > java vs2 age_2962.txt clusters_a.txt
     > java vs2 age_2962.txt clusters_b.txt
     Output examples: clusters_a.GIF, clusters_b.GIF

Limitation: maximum 25 clusters, 4 class labels.


References from US Census Bureau (Census 2000)

Map of Wyoming County (link)
Population Estimate for Counties of Wyoming (pdf)

4 Wyoming datasets created - based on the following Census Data (for class label)
a) Household Income in 1999
http://factfinder.census.gov/servlet/DTTable?_bm=y&-context=dt&-ds_name=DEC_2000_SF3_U&-mt_name=DEC_2000_SF3_U_P052&-CONTEXT=dt&-tree_id=403&-redoLog=true&-all_geo_types=N&-currentselections=DEC_2000_SF3_U_P052&-geo_id=05000US56001&-geo_id=05000US56003&-geo_id=05000US56005&-geo_id=05000US56007&-geo_id=05000US56009&-geo_id=05000US56011&-geo_id=05000US56013&-geo_id=05000US56015&-geo_id=05000US56017&-geo_id=05000US56019&-geo_id=05000US56021&-geo_id=05000US56023&-geo_id=05000US56025&-geo_id=05000US56027&-geo_id=05000US56029&-geo_id=05000US56031&-geo_id=05000US56033&-geo_id=05000US56035&-geo_id=05000US56037&-geo_id=05000US56039&-geo_id=05000US56041&-geo_id=05000US56043&-geo_id=05000US56045&-search_results=01000US&-format=&-_lang=en
b) Poverty Status in 1999
http://factfinder.census.gov/servlet/DTTable?_bm=y&-context=dt&-ds_name=DEC_2000_SF3_U&-CONTEXT=dt&-mt_name=DEC_2000_SF3_U_PCT056&-tree_id=403&-redoLog=false&-all_geo_types=N&-currentselections=DEC_2000_SF3_U_P052&-geo_id=05000US56001&-geo_id=05000US56003&-geo_id=05000US56005&-geo_id=05000US56007&-geo_id=05000US56009&-geo_id=05000US56011&-geo_id=05000US56013&-geo_id=05000US56015&-geo_id=05000US56017&-geo_id=05000US56019&-geo_id=05000US56021&-geo_id=05000US56023&-geo_id=05000US56025&-geo_id=05000US56027&-geo_id=05000US56029&-geo_id=05000US56031&-geo_id=05000US56033&-geo_id=05000US56035&-geo_id=05000US56037&-geo_id=05000US56039&-geo_id=05000US56041&-geo_id=05000US56043&-geo_id=05000US56045&-search_results=01000US&-format=&-_lang=en
c) Race
http://factfinder.census.gov/servlet/DTTable?_bm=y&-context=dt&-ds_name=DEC_2000_SF1_U&-mt_name=DEC_2000_SF1_U_P007&-CONTEXT=dt&-tree_id=4001&-all_geo_types=N&-geo_id=05000US56001&-geo_id=05000US56003&-geo_id=05000US56005&-geo_id=05000US56007&-geo_id=05000US56009&-geo_id=05000US56011&-geo_id=05000US56013&-geo_id=05000US56015&-geo_id=05000US56017&-geo_id=05000US56019&-geo_id=05000US56021&-geo_id=05000US56023&-geo_id=05000US56025&-geo_id=05000US56027&-geo_id=05000US56029&-geo_id=05000US56031&-geo_id=05000US56033&-geo_id=05000US56035&-geo_id=05000US56037&-geo_id=05000US56039&-geo_id=05000US56041&-geo_id=05000US56043&-geo_id=05000US56045&-search_results=01000US&-format=&-_lang=en
d) Age
http://factfinder.census.gov/servlet/DTTable?_bm=y&-context=dt&-ds_name=DEC_2000_SF1_U&-CONTEXT=dt&-mt_name=DEC_2000_SF1_U_P012&-tree_id=4 001&-redoLog=false&-all_geo_types=N&-geo_id=05000US56001&-geo_id=05000US56003&-geo_id=05000US56005&-geo_id=05000US56007&-geo_id=05000US56009&-geo_id=05000US56011&-geo_id=05000US56013&-geo_id=05000US56015&-geo_id=05000US56017&-geo_id=05000US56019&-geo_id=05000US56021&-geo_id=05000US56023&-geo_id=05000US56025&-geo_id=05000US56027&-geo_id=05000US56029&-geo_id=05000US56031&-geo_id=05000US56033&-geo_id=05000US56035&-geo_id=05000US56037&-geo_id=05000US56039&-geo_id=05000US56041&-geo_id=05000US56043&-geo_id=05000US56045&-search_results=01000US&-format=&-_lang=en