We applied a well known clustering algorithm called K-Means to explore the project data at both the national and regional levels. The premise was to find interesting groupings of projects that shared common features. We used converted all the categorical variables into indicator variables so the Euclidean distance measure used by K-Means would be meaningful. In other words, the K-Means algorithm essentially counted the number of differences between each project to determine which were similar to each other. It should be noted that the cluster analysis was done on the April 2011 data.

The resulting clusters highlighted some regional differences on the national level and some patterns of unusual giving within each region. For instance, there was a dichotomy between some projects in the Western United State and those in the South. Consider the following clusters:

West vs South

Variable West (Cluster 2) South (Cluster 11) Nation Average
Essential items? 37.7% 57.8% 51.9%
Average Students Reached 145.5 158.2 100.8
Average Project Cost $748.17 $748.32 $535.68
Number of Projects 17732 17606 N/A
Success Rate 68.8% 0.0% 70.6%

These two clusters are largely the same except they are from different regions. Both are mostly requesting technology, however the success rate for the South is much lower. Furthermore, consider these two clusters from the Northeast and South:

East vs South

Variable East (Cluster 6) South (Cluster 5) Nation Average
Type of request Mostly technology Mostly basic supplies N/A
Essential items? 4.0% .01% 51.9%
Average Students Reached 135.3 94.7 100.8
Average Project Cost $1163.74 $412.58 $535.68
Number of Projects 20161 30088 N/A
Success Rate 72.5% 66.8% 70.6%

Again, the South appears to be not as successful at attaining funding. The cluster from the Northeast has a higher success rate when requesting non-essential items over twice as much in value as the South. However, a cluster from the South requesting cheap technology is one of the two most successful projects:

Most Successful clusters

Variable West (Cluster 4) South (Cluster 8) Nation Average
Type of request Books Technology N/A
Essential items? 69.9% 44.0% 51.9%
Average Students Reached 72.3 99 100.8
Average Project Cost $427.68 $471.82 $535.68
Number of Projects 14430 13102 N/A
Success Rate 81% 100.0% 70.6%

National Results

Full Data Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Cluster 7 Cluster 8 Cluster 9 Cluster 10 Cluster 11 Cluster 12
Cluster Size 271384 50922 17732 18348 14430 30088 20161 37136 13102 11900 39959 17606
West 17.71% 0.00% 100.00% 46.51% 100.00% 0.00% 0.00% 19.84% 0.00% 0.00% 0.00% 0.00%
Southwest 4.38% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00% 100.00% 0.00% 0.00%
South 33.37% 0.00% 0.00% 0.00% 0.00% 100.00% 0.00% 80.16% 100.00% 0.00% 0.00% 100.00%
Midwest 18.76% 100.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00%
Northeast 25.77% 0.00% 0.00% 53.49% 0.00% 0.00% 100.00% 0.00% 0.00% 0.00% 100.00% 0.00%
High Poverty 83.67% 82.67% 79.65% 84.52% 75.79% 82.19% 87.87% 88.39% 81.93% 72.87% 89.69% 78.82%
Books 23.73% 23.66% 0.00% 0.00% 76.52% 28.83% 26.50% 22.34% 0.00% 27.26% 30.25% 20.77%
Technology 28.25% 29.02% 82.18% 0.00% 0.00% 0.00% 50.03% 0.00% 100.00% 28.84% 21.52% 68.69%
Other Resource 9.70% 9.37% 13.17% 0.00% 13.18% 17.49% 18.06% 6.93% 0.00% 9.26% 7.37% 10.12%
Supplies 36.99% 36.79% 0.00% 100.00% 9.87% 52.00% 0.00% 70.69% 0.00% 34.35% 39.76% 0.00%
Essential Materials 51.93% 51.16% 37.77% 0.70% 69.94% 0.74% 4.04% 99.14% 44.00% 52.13% 94.93% 57.88%
Average Number of Students 101 87 146 89 72 95 135 104 99 94 76 158
Average Project Cost 536 474 748 410 428 413 1164 388 472 464 477 748
Fully Funded 70.56% 69.42% 68.80% 79.65% 81.05% 66.82% 72.51% 79.57% 100.00% 74.11% 78.66% 0.00%

Western States

Full Data Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Cluster 7 Cluster 8 Cluster 9 Cluster 10
Cluster Size 48062 6840 4238 2947 4310 5801 3451 6732 6671 7072
High Poverty 81.80% 82.09% 81.57% 77.23% 83.23% 81.07% 76.73% 85.19% 78.95% 85.24%
Books 22.97% 0.00% 0.00% 0.00% 100.00% 0.00% 0.00% 100.00% 0.00% 0.00%
Technology 30.32% 0.00% 0.00% 91.01% 0.00% 89.97% 0.00% 0.00% 100.00% 0.00%
Other Resource 8.82% 0.00% 100.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00%
Supplies 36.05% 100.00% 0.00% 0.00% 0.00% 0.00% 99.77% 0.00% 0.00% 99.56%
Essential Materials 50.38% 0.29% 44.67% 0.03% 0.08% 0.62% 53.26% 99.82% 100.00% 99.45%
Average Number of Students 100 74 68 431 75 89 86 67 95 77
Average Project Cost 540 368 447 1157 388 605 553 433 806 387
Fully Funded 76.43% 100.00% 77.16% 0.00% 82.81% 100.00% 0.00% 83.50% 68.37% 100.00%

Southwest States

Full Data Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Cluster 7 Cluster 8 Cluster 9
Cluster Size 11900 941 1875 637 1983 1868 1102 1281 2213
High Poverty 72.87% 81.62% 70.93% 0.00% 76.30% 85.12% 75.23% 74.71% 76.14%
Technology 28.84% 99.47% 0.00% 98.59% 0.00% 100.00% 0.00% 0.00% 0.00%
Supplies 34.35% 0.00% 100.00% 0.00% 0.00% 0.00% 0.00% 0.00% 100.00%
Books 27.26% 0.00% 0.00% 0.00% 100.00% 0.00% 0.00% 98.44% 0.00%
Other Resource 9.26% 0.00% 0.00% 0.00% 0.00% 0.00% 100.00% 0.00% 0.00%
Essential Materials 52.13% 53.35% 0.00% 0.00% 100.00% 54.01% 45.01% 0.00% 100.00%
Average Number of Students 94 100 92 96 70 114 73 82 112
Average Project Cost 464 925 376 625 407 516 391 394 378
Fully Funded 74.11% 0.00% 77.33% 57.46% 78.97% 100.00% 72.05% 81.42% 78.26%

South

Full Data Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Cluster 7 Cluster 8 Cluster 9 Cluster 10 Cluster 11 Cluster 12
Cluster Size 90565 20628 5516 6196 3361 6399 16431 9250 2368 9616 295 10505
High Poverty 82.59% 82.98% 78.59% 81.67% 0.00% 60.18% 100.00% 100.00% 0.00% 83.29% 81.36% 100.00%
Books 22.78% 100.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00%
Technology 27.82% 0.00% 0.00% 0.00% 0.00% 99.97% 100.00% 0.00% 100.00% 0.00% 0.00% 0.00%
Other Resource 10.62% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00% 100.00% 0.00% 0.00%
Supplies 38.13% 0.00% 95.07% 99.64% 99.85% 0.00% 0.00% 100.00% 0.00% 0.00% 0.00% 100.00%
Essential Materials 50.36% 58.62% 0.00% 99.74% 48.36% 74.15% 29.58% 3.11% 38.07% 45.63% 7.83% 100.00%
Average Number of Students 113 81 109 101 109 114 161 97 110 86 64 153
Average Project Cost 476 386 633 489 321 852 595 344 448 431 999 345
Fully Funded 62.76% 70.06% 0.00% 0.00% 100.00% 0.00% 65.33% 100.00% 100.00% 61.09% 98.98% 100.00%

Northeast

Full Data Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Cluster 7 Cluster 8 Cluster 9
Cluster Size 69935 6585 5279 20817 6586 5342 3766 9414 12146
High Poverty 88.90% 88.03% 84.20% 90.73% 88.41% 87.98% 83.72% 88.50% 90.86%
Books 24.92% 0.00% 0.00% 0.00% 0.00% 100.00% 0.00% 0.00% 99.51%
Supplies 36.75% 0.00% 98.30% 98.54% 0.00% 0.00% 0.00% 0.00% 0.00%
Technology 26.72% 0.00% 0.00% 0.00% 88.51% 0.00% 91.37% 100.00% 0.00%
Other Resource 9.42% 100.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00%
Essential Materials 55.83% 50.38% 58.33% 57.92% 0.00% 0.00% 1.07% 96.17% 94.65%
Average Number of Students 97 86 92 82 240 72 102 101 58
Average Project Cost 667 2142 563 373 608 382 1115 682 424
Fully Funded 76.85% 73.49% 0.00% 100.00% 100.00% 84.16% 0.00% 71.14% 84.87%

Midwest

Full Data Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Cluster 7 Cluster 8 Cluster 9 Cluster 10
Cluster Size 50922 4230 4857 4773 12440 1245 11357 2107 6296 3617
High Poverty 82.67% 100.00% 100.00% 81.75% 84.50% 100.00% 81.49% 0.00% 85.59% 74.84%
Books 23.66% 92.03% 99.59% 0.00% 0.00% 99.60% 0.00% 98.58% 0.00% 0.00%
Supplies 36.79% 0.00% 0.00% 0.00% 100.00% 0.00% 0.00% 0.00% 100.00% 0.00%
Technology 29.02% 0.00% 0.00% 0.00% 0.00% 0.00% 99.99% 0.00% 0.00% 94.64%
Other Resource 9.37% 0.00% 0.00% 100.00% 0.00% 0.00% 0.00% 0.00% 0.00% 0.00%
Essential Materials 51.16% 0.08% 99.70% 46.02% 80.39% 99.96% 57.35% 57.88% 0.51% 0.01%
Average Number of Students 87 75 69 76 91 64 95 88 94 99
Average Project Cost 474 420 382 429 417 571 562 400 345 881
Fully Funded 69.42% 81.96% 100.00% 67.76% 57.94% 0.00% 78.06% 67.58% 100.00% 0.00%