We ran logistic regression on about 100 different variables ranging from the prefix of the projects teachers name e.g. Mr. or Ms. to the cost of the project. The purpose was to attempt to predict the success of the project i.e. whether it achieved full funding based on the known variables when the project was purposed. Logistic regression learns weights for each different variable in order to makes its prediction, essentially determining which variables are important factors for predicting the future success or failure of the project. The importance of the variable is expressed as an odds ratio, that is, the probability of success given the information the variable provides divided by the probability of failure given the same variable. Basically, an odds ratio of greater than one indicates the variable is generally indicative of a successful project whereas a value of less that one means the opposite. The odds ratio for all the variables along with the raw output from Weka (the software used to run logistic regression) is attached to this page, however we are presenting the 25 most significant variables. The results from the logistic regression analysis generally agree with the results of the clustering analysis, that is, the Southern states appear to be at significant disadvantage to the Northeastern and Western states. Also, projects requesting books and trips are stronger than those requesting technology As for subjects, Sports, Economics, and Nutrition outweigh Character Education, Early Development, and Parent Involvement. Finally, projects with matching donations were some of the strongest. Finally, logistic regression was able to accurately predict the success or failure of a project 72% of the time.
Top 25 Most Significant Variables
In short, the values greater than 1 mean it indicates success whereas values less that 1 mean the opposite.
| Variables indicating likely success | Odds Ratio |
|---|---|
| Resource Requested: Trips | 1.5730690577 |
| Primary Subject: Sports | 1.5330369462 |
| Primary Subject: Economics | 1.5156107911 |
| Resource Requested: Books | 1.4788524105 |
| Secondary Subject: Nutrition | 1.4518002323 |
| High Provery Project | 1.446131598 |
| Primary Subject: Environmental Science | 1.4459224986 |
| Matching | 1.4371945961 |
| Primary Subject: Music | 1.4361625736 |
| Western States | 1.3168290756 |
| School Setting: Urban | 1.2529758176 |
| Northeastern States | 1.2363996044 |
| Variables indicating likely failure | Odds Ratio |
|---|---|
| Primary Area: Applied Learning | 0.8697921197 |
| Secondary Subject: Other | 0.8628872206 |
| Resource Requested: Visitors | 0.8602150538 |
| Secondary Subject: Character Education | 0.8593280055 |
| Primary Subject: ESL | 0.852514919 |
| Project usable by future students | 0.8437394533 |
| Primary Subject: Parent Involvement | 0.8329862557 |
| Primary Subject: Early Development | 0.8168599902 |
| Primary Subject: Other | 0.8130742337 |
| Primary Subject: Character Education | 0.8043758044 |
| School Setting: Rural | 0.7971303308 |
| Southern States | 0.7432733759 |
| Resource Requested: Technology | 0.7119464616 |
