Datasets : Harry Potter: words occuring near the word 'Lupin'

Uploaded By: dumbledad Created at: Monday May 14, 8:41 AM
Data Source: The Harry Potter series of books by J. K. Rowling
Description: All the words that occur within ten words of the word "Lupin" within the Harry Potter series. The match uses SQL Servers LIKE command so I'm hoping that includes "Lupin's" etc. Some simple stop words are removed before the dataset was gathered. The columns are the word itsel, how often it occured, how often it occurs throughout the series, the first and last book it occurs in, and the position in the series overall that it first occurs in. This is part of some work I am doing trying to use statistics and data visualization to find out more about the Harry Potter books prior to the final book coming out. I presented some of this at Accio 2005. http://research.microsoft.com/~timregan/
Tags: kids_lit fantasy books jkr harry_potter literature


View_as_text_button Edit_dataset_disabled_button
word wordCount frequencyInSeries firstBook lastBook firstPosition
1 seats 1 83 3 3 180233
2 farthest 1 3 3 3 180234
3 away 15 882 3 5 180235
4 window 3 344 3 5 180238
5 professor 148 1924 3 6 180239
6 r 2 18 3 3 180240
7 j 2 5 3 3 180241
8 lupin 659 591 3 6 180242
9 whispered 3 343 3 5 180243
10 hermione 45 3871 3 6 180244
11 once 15 871 3 5 180246
12 know 30 1790 3 6 180249
13 quantity 1 8 3 3 180278
14 neatly 1 18 3 3 180280
... ... ... ... ... ... ...
Watch_this_disabled Add_to_topic_hub_disabled Visualize_button Rate_this_disabled

Not_rated_big This data set
has not yet been rated

Visualizations of this data set

Gray_megaphone_small Part of these topic hubs

Gray_binoculars_small Being watched by

Versions (1)

1. Original Data Set by dumbledad on May 14 2007

Comments (0)

Post a comment as Anonymous

Please verify that you are human

simple_captcha.jpg
(type the code from the image)