Pythonデータサイエンスクックブック

(2順目)

レシピ7.1

ここからデータを取得
https://github.com/ipython-books/cookbook-data

data/Roger-Federer.csvを読み込んでみると 列が70,行が1179ありかなりでかい表。 単にdfとしただけでは列の要素が省略され全部出てこないので、

for col in df.columns:
    print(col)

とした。

    year:トーナメントの開催年
    tournament:トーナメント名
    start date:トーナメント開始日
    type
    surface
    draw
    atp points
    atp ranking
    tournament prize money
    round
    opponent
    ranking
    score
    stats link
    tournament.1
    tournament round
    time
    winner:勝利者
    player1 name
    player1 nationality
    player1 aces
    player1 double faults:ダブルフォールトの回数
    player1 1st serves in
    player1 1st serves total
    player1 1st serve points won
    player1 1st serve points total
    player1 2nd serve points won
    player1 2nd serve points total
    player1 break points won
    player1 break points total
    player1 service games played
    player1 1st serve return points won
    player1 1st serve return points total
    player1 2nd serve return points won
    player1 2nd serve return points total
    player1 break points converted won
    player1 break points converted total
    player1 return games played
    player1 total service points won
    player1 total service points total
    player1 total return points won
    player1 total return points total
    player1 total points won
    player1 total points total:総得点数
    player2 name
    player2 nationality
    player2 aces
    player2 double faults
    player2 1st serves in
    player2 1st serves total
    player2 1st serve points won
    player2 1st serve points total
    player2 2nd serve points won
    player2 2nd serve points total
    player2 break points won
    player2 break points total
    player2 service games played
    player2 1st serve return points won
    player2 1st serve return points total
    player2 2nd serve return points won
    player2 2nd serve return points total
    player2 break points converted won
    player2 break points converted total
    player2 return games played
    player2 total service points won
    player2 total service points total
    player2 total return points won
    player2 total return points total
    player2 total points won
    player2 total points total

10項
総得点数に対するダブルフォールトの割合を時系列でグラフ化している。 折れ線グラフの推移からだんだんダブルフォールトの回数が下がっているようだ。 f:id:bitop:20160313085017p:plain