Mar
15

Choosing the Right College

Is it worth paying more for a “top 10” school?

You get accepted to a private school / Ivy League that will cost a fortune. You’re also accepted by a public university which doesn’t have as good a reputation but costs much less. Does it matter which one you choose?


One factor in your decision is if the school will increase your chance of being successful. A school could be ‘worth it’ if it produces successful people, which we define as its alumni appearing in Wikipedia.


To answer this question, we identified the number of ‘successful’ graduates for each college (defined as the alumni appearing in Wikipedia). We calculated the likelihood of appearing in Wikipedia if one was an alumni from a given college.

Method Details:
The likelihood of a college alumnus appearing in Wikipedia is calculated as a relative ratio.
If the relative ratio is 1, then that means that the number of alumnus observed in Wikipedia follows what’s expected based on the college size.
If the relative ratio is greater than 1, then that means that the number of people in Wikipedia is higher than what’s expected base on its college size — and this school increases your chance of success.
Equations here

The table below shows the colleges with the most Wikipedia enrichment.

For example, Harvard has a relative ratio of 50, which means that alumni are 50x more likely to appear in Wikipedia than expected.


CollegeEnrichment in Wikipedia
American Conservatory Theater124.63
Harvard College50.38
Curtis Institute of Music59.97
Columbia University33.50
Juilliard School33.90
Yale University24.37
San Francisco Art Institute25.80
Princeton University20.05
Manhattan School of Music16.17
New England Conservatory of Music16.12
California Institute of the Arts14.62
Stanford University12.06
California Institute of Technology12.96
Massachusetts Institute of Technology11.27
Swarthmore College12.14
Northwestern University10.23
Amherst College11.03
Golden Gate University12.22
Williams College9.99
Bennington College10.70
Trinity College10.26
Cleveland Institute of Music12.57
Shimer College14.20
Johns Hopkins University8.20
University of Chicago7.96
Brown University7.96
Sarah Lawrence College8.83
Cooper Union for the Advancement of Science and Art9.25
Vassar College8.39
Columbia College8.71
Duke University7.45
Dartmouth College7.65
Wesleyan University7.87
Berklee College of Music7.43
Goddard College8.66
Oberlin College7.53
Rhode Island School of Design7.68
Georgetown University6.36
Pomona College7.04
Haverford College7.25
Reed College7.05
San Francisco Conservatory of Music8.36
Brandeis University5.82
University of Pennsylvania5.36
Cornell University5.30
Barnard College5.89
Bowdoin College5.99
National Defense University6.69
University of California Berkeley4.95
University of Southern California4.77
University of California Los Angeles4.75
St. John's College6.72
American University4.92
Davidson College5.56
Art Center College of Design5.49
Occidental College5.28
Wellesley College5.16
Bard College5.17
Hastings College5.57
Howard University4.67
University of Notre Dame4.50
Pontifical College Josephinum8.50
University of Michigan3.99
Smith College4.55
University of Rochester4.15
School of Visual Arts4.39
University of Miami4.04
Westminster College5.02
Marietta College4.78
University of Virginia3.90
New York University3.80
Middlebury College4.45
Mannes College of Music4.84
Southern Methodist University3.89
United States Military Academy4.10
Union College4.56
United States Naval Academy3.99
Kenyon College4.43
Naval Postgraduate School4.43
Illinois College4.78
Boston College3.65
United States Air Force Academy3.91
Wake Forest University3.75
Boston Conservatory4.92
University of Richmond3.96
College of the Holy Cross3.99
Wheaton College3.97
Mitchell College4.52
Colgate University3.93
Tulane University3.47
Lake Forest College4.21
Morehouse College3.92
Hampshire College4.16
School of the Museum of Fine Arts4.58
Carleton College3.94
Mills College4.00
Bryn Mawr College3.87
Rush University6.10
Catholic University of America3.34
Grinnell College3.84
Rice University3.34
Antioch University Los Angeles6.24
Kansas City Art Institute4.28
Colorado College3.60
Carnegie Mellon University3.04
Pratt Institute3.24
Bates College3.60
Vanderbilt University2.97
Tufts University2.92
Mount Holyoke College3.27
Hamilton College3.34
St. Charles Borromeo Seminary5.24
Syracuse University2.68
School of the Art Institute of Chicago3.06
Warren Wilson College3.59
York College4.03
Lincoln University3.71
University of Texas at Austin2.38
Otis College of Art and Design3.18
Texas College3.30
Lawrence University3.05
Whitman College2.98
Millsaps College3.14
Louisiana State University2.26
University of Tulsa2.59
University of Dallas2.70
Southwestern University3.00
Fisk University3.42
Macalester College2.77
University of the Southwest3.49
Willamette University2.64



The relative ratio of appearing in Wikipedia is plotted against the U.S. News & World Report rankings. We see that people who attend the top-ranked schools do have a higher likelihood of appearing in Wikipedia.

alt text

So this supports going to a top-rank school. Also notice what happens after rank 40, the college doesn’t seem to matter for getting into Wikipedia.

Full analysis with gory details

Also, check out my earlier post which show that you don’t even need to go to college for certain professions.

Feb
26

Is going to college worth it?

College is expensive. Students are graduating with massive debts that take the rest of their lives to pay off. Is it worth it? Bill Gates and Steve Jobs never graduated from college, so perhaps a college degree isn’t even necessary. But are Bill Gates, Steve Jobs, Mark Zuckenberg, and Joi Ito rarities in this world, or is this a more general trend?

Let’s analyze Wikipedia for some insight to this crucial question. To help with this task, I’ve created a tool that examines 100,000 biographies of notable individuals born between 1930-1980. This article summarizes my findings.

I’ve divided the results into 5 categories of notable individuals:

  1. Entertainers/Artists (famous singers, writers, etc.)
  2. Athletes, e.g. NBA and NFL professional athletes
  3. Politicians e.g. senators, presidents, protestors
  4. Business people e.g. Warren Buffet, Steve Jobs
  5. Academic nerds e.g. engineers, computer programmers, etc.

Note: A person in Wikipedia can be in more than one category. On his Wikipedia page, Bill Gates is categorized as being both an “American computer programmer” and as “Businesspeople from Seattle.”

College Education by Occupation

59% of the Americans in Wikipedia have no college information on their Wikipedia biography.
That’s surprisingly large.

Is it because Wikipedia simply lacks biographical entries on college education? Probably not. One would expect that almost all notable academics would have a college degree. 79% of the biographies of academics have higher education in their biographies – so perhaps the remaining 21% have incomplete biographical records. Assuming this is a valid proxy for under-reporting higher education, one might guess that, as a lower bound, 59% – 21% = 38% of Wikipedians don’t have a college education.

Furthermore, it stands to reason that if a college education was relevant to a Wikipedian’s achievements, the editorial community would include it in the biographical record. Therefore, even if these results are biased by an under-reporting of college information, it is still a valid indicator of the relative importance of college education to an individual’s notability.

Finally, the relative percentage of individuals with college education is consistent with expectations for the five occupations:


  • Athletes and entertainers/artists don’t require a college education to be successful (70% of the athletes and 60% of entertainers/artists don’t have college educations).
  • About half of business people went to college, but half did not.

College Rates over Time

Given the social pressure to go to college over the last few decades, one would expect that the fraction of people in Wikipedia with a college education would likewise rise over the same time preiod.

Instead, we find that education rates for people in Wikipedia have remained constant over time, despite a general trend in society toward higher education.

The dashed upward-trending line in the graph above shows how the general population is being convinced to go to college; the flat or downward-trending lines for education rates in Wikipedia show that college education has had little to no bearing on one’s accomplishments. In fact, in some disciplines it seems going to college hampers one’s ability to achieve success.

Business people show a small but significant decline in education over time (p = 0.003). Interestingly, the decline in education for businesspeople starts for those born in the 1960’s. This would correspond to being educated in the mid-1980’s which coincides with a recession where college tuition may have been impractical. It also coincides with the development of the World Wide Web which could have created other opportunities.




Entertainers can become successful at a young age without needing further education.
Just look at the Disney child actors that transition into adult roles.

Successful athletes have been showing increased college education rates over the years. This is probably a result of social pressure on college athletics programs to educate their athletes, in addition to using them to fill stadiums and power a money making machine. However, attendance may not mean graduation with a Bachelor’s degree because some athletes choose to leave college and become a professional athlete before graduation. Furthermore, athletes have been given passing grades in classes they never attended, so the ‘education’ aspect of college may be missing.

So is going to college worth it?

If you already know what you want to be – and you’ve proven you have a knack for it – then you may be better off to keep on doing what you’re doing, and skipping the college debt. Just ask the child actors, professional athletes, or all the young entrepreneurs running successful businesses without college degrees.

Jun
02

Mining Wikipedia paper at ICWSM 2012

Britney Spears and Kobe Bryant at VMA Yay! My paper entitled “What Britney Spears and Kobe Bryant Have in Common: Mining Wikipedia for Characteristics of Notable Individuals” was accepted at ICWSM 2012

The pdf can be downloaded here:
Mining Wikipedia For Characteristics of Notable Individuals.pdf

So what do Britney and Kobe have in common? They’re both successful, and my research shows that having a rare name increases the chance of success.

Wait — you say, Britney is a common name! Not so — when Britney was born in 1981, her name was far down the list of popular names — #758 as a matter of fact. So in her age group, her name was quite rare, and that really distinguished her from other musicians. I remember listening to those albums back then, people always said “Christina Aguilera”, but when you said “Britney”, everyone knew you were talking about the one-and-only Ms. Spears. Only later, when she gained immense popularity, did her name become common as parents started naming their daughters “Britney” (the name Britney rose to rank #137 in 2000).

When Britney was becoming a star, her uncommon name helped her. This is not surprising for entertainers, but according to my research, this observation holds for athletes and successful people in general.
And if you don’t have an uncommon name, then my research shows that using a nickname also helps. Think ‘Steve’ Jobs.

I also looked at birth locations. If you’re born in California or New York, you’re 2x more likely to become an entertainer. Not too surprising, because of Hollywood & Broadway. If you’re born in the South, there is increased chance of becoming an athlete.

This isn’t to say that if you have a common name or you weren’t born in these states, there’s no chance you will become famous. It just shows there is an enrichment for these characteristics.

So if you have a common name, try using a nickname!

Results:

  • People with rare names more than 2x likely to appear in Wikipedia (2.43x for women; 2.30x for men). [More]
  • People with nicknames are also more likely to be in Wikipedia. Males with nicknames are 2.39x more likely to appear in Wikipedia while for females it’s a 1.32x increase
  • Individuals born in New York and California are ~2x more likely to become entertainers, and those born in the South are ~1.5x more likely to become athletes.[More]

There’s a lot of data in Wikipedia, it can be mined for much much more. This paper describes a couple of features — more associations can be gleaned in the future.

 

Sep
25

Names and Birth States Found Frequently in Wikipedia

Oprah Winfrey, a successful person with an uncommon name.
Oprah Winfrey, a successful person with an uncommon name.
Bill Gates , Microsoft founder and philanthropist. Born as William Gates, but everyone calls him Bill.

Bill Gates , Microsoft founder and philanthropist.  Real name is William Gates, but commonly called Bill.

What makes a person successful? As parents, we try to make the best choices to help our children become successful and happy.

One of the first things that we decide on when having a baby is the child’s name. Another choice is where to live. Are these relevant in determining a child’s future success?

Wikipedia is full of successful people. I looked at the characteristics of people in Wikipedia to see if they are any different from the average population.

I looked to see if certain names and birthplaces occur in Wikipedia more often than expected . Click here for more details on the analysis.

Analysis on Names in Wikipedia

  • Rare names are enriched in Wikipedia. Names that are less than 1% frequent in the population are 2x more likely to appear in Wikipedia, regardless of gender.
  • If born with a common name, you’re more likely to appear in Wikipedia if you use a nickname rather than the formal name given at birth. For example, Michael appears in Wikipedia 42% less than expected, but its corresponding nickname “Mike” appears 9.7x more frequently than expected in Wikipedia.
  • Visualize names here

    Analysis of Birth States in Wikipedia

  • More entertainers/artists are born in California and New York (~2-fold enrichment)
  • More athletes are born in the Southern states (~1.5-fold enrichment)
  • Visualize all states here

    Download the source code:

    Code for analyzing Wikipedia biographies

    May
    14

    Is it safe to visit this country?

    Have you ever wondered whether it was safe to go to a certain country? Last March, I had a conference in the Middle East and wanted to visit Lebanon as a side trip.   But family was saying no, it’s dangerous while  travel forums were saying it was safe.   I was confused with all the conflicting information and I wanted unbiased facts — how safe was it to visit Lebanon?

    The Foreign & Commonwealth Office in UK has an amazing amount of statistics on their UK citizens that travel. Because they publish the number of annual tourists AND how many died or  were hospitalized,  I could calculate the danger rate by simple division:

    Safety risk = # of hospitalizations & deaths for tourists / total # of tourists

    The safety risk for visiting each country is pretty low overall. For example, according to my numbers, the most dangerous country to visit is Philippines, where about 1 in 1,000 tourists will run into some trouble. However, even though this is a relatively small number, you could ask, how safe is visiting the Philippines compared to say, visiting the United States? By calculating the relative risk, it is ~19 times more dangerous to visit the Philippines than to visit the United States.

    Safety risk includes the number of deaths and hospitalizations. I also looked at the rate of “major incidents” which also includes arrests, assaults, and missing persons as well as deaths and hospitalizations.  Because nothing spoils a vacation like going to jail.

    Surprisingly, while the U.S. is a safe place to go (in the top 10 of countries with a lower chance of dying or being hospitalized as a tourist), a lot of Brits were arrested in the U.S.  You’re 6 times more likely to be arrested in the U.S. than in China!

    CountrySafety Rank Major Incident Rank Safety Risk relative to visiting U.S.Major Incident Risk relative to visiting U.S.Safety RateMajor Incidents Rate
    Austria210.440.102.2E-053.0E-05
    Belgium120.400.132.0E-053.9E-05
    France530.780.163.9E-054.7E-05
    Hungary340.660.163.3E-054.8E-05
    Latvia450.660.163.3E-054.9E-05
    Mauritius1161.010.205.0E-056.0E-05
    Italy1271.040.215.2E-056.1E-05
    Albania980.950.214.7E-056.3E-05
    Oman1391.190.245.9E-057.0E-05
    Netherlands6100.850.254.2E-057.5E-05
    Czech Republic14111.200.256.0E-057.5E-05
    Romania16121.340.316.7E-059.2E-05
    Slovenia7130.890.334.4E-059.9E-05
    Singapore15141.250.376.2E-051.1E-04
    Croatia20151.880.379.3E-051.1E-04
    Switzerland17161.680.388.4E-051.1E-04
    Turkey23171.970.429.8E-051.2E-04
    Uruguay37182.690.451.3E-041.3E-04
    Estonia18191.690.478.4E-051.4E-04
    China24202.070.481.0E-041.4E-04
    Bangladesh20211.880.499.3E-051.5E-04
    Egypt34222.560.511.3E-041.5E-04
    Malaysia28232.320.511.1E-041.5E-04
    Russian Federation38243.030.531.5E-041.6E-04
    Fiji32252.420.541.2E-041.6E-04
    St Lucia8260.900.554.4E-051.6E-04
    Pakistan22271.900.579.4E-051.7E-04
    Ukraine30282.390.601.2E-041.8E-04
    Australia40293.640.611.8E-041.8E-04
    Portugal43303.910.671.9E-042.0E-04
    New Zealand26312.240.681.1E-042.0E-04
    Morocco36322.670.751.3E-042.2E-04
    Lebanon48334.550.762.3E-042.3E-04
    South Africa41343.830.771.9E-042.3E-04
    Malta44354.030.772.0E-042.3E-04
    Brazil33362.450.841.2E-042.5E-04
    Indonesia49374.570.922.3E-042.7E-04
    Mexico42383.900.981.9E-042.9E-04
    Barbados35392.660.981.3E-042.9E-04
    Sweden29392.380.981.2E-042.9E-04
    United States10411.001.005.0E-053.0E-04
    Denmark27422.291.031.1E-043.1E-04
    Dominican Republic52435.151.042.6E-043.1E-04
    Costa Rica46444.201.052.1E-043.1E-04
    Taiwan25452.121.061.1E-043.2E-04
    Mongolia62466.501.093.2E-043.2E-04
    Sri Lanka54475.311.132.6E-043.3E-04
    Ghana50484.611.152.3E-043.4E-04
    United Arab Emirates19491.751.198.7E-053.5E-04
    India58506.101.203.0E-043.6E-04
    Spain45514.061.202.0E-043.6E-04
    Greece55525.321.212.6E-043.6E-04
    Germany59536.171.243.1E-043.7E-04
    Ecuador47544.361.462.2E-044.3E-04
    Japan31552.411.461.2E-044.3E-04
    Norway51564.841.512.4E-044.5E-04
    Botswana53575.311.592.6E-044.7E-04
    Vietnam68589.831.644.9E-044.9E-04
    Cyprus63596.751.653.3E-044.9E-04
    Peru39603.611.761.8E-045.2E-04
    Uganda56615.381.802.7E-045.3E-04
    Cambodia64626.851.803.4E-045.3E-04
    Brunei706310.861.815.4E-045.4E-04
    Colombia65647.011.903.5E-045.7E-04
    Kenya66657.491.953.7E-045.8E-04
    Zambia67668.402.244.2E-046.7E-04
    Belarus696710.082.535.0E-047.5E-04
    Qatar57686.052.693.0E-048.0E-04
    Thailand726913.263.126.6E-049.3E-04
    Ethiopia61706.303.583.1E-041.1E-03
    Venezuela60716.203.633.1E-041.1E-03
    Philippines737218.823.789.3E-041.1E-03
    Belize717311.863.965.9E-041.2E-03

     

    Caveats aka Grain of Salt:

    • Numbers are from April 1, 2009-March 31, 2010 so if there was some out-of-the-ordinary event in the country during that time, it could inflate numbers
    • Tourists = UK citizens travelling  & living abroad.   I don’t know why Thailand is so dangerous — I’ve been there myself and want to go back again, but if UK citizens are retiring there, then it would make sense they would die there (and this increase the danger rate.)