The raw data behind the story "'Straight Outta Compton' Is The Rare Biopic Not About White Dudes" https://fivethirtyeight.com/features/straight-outta-compton-is-the-rare-biopic-not-about-white-dudes/. An analysis using this data was contributed by Pradeep Adhokshaja as a package vignette at http://fivethirtyeight-r.netlify.com/articles/biopics.html.
A data frame with 761 rows representing movies and 14 variables:
Title of the film.
Text to construct IMDB url. Ex: http://www.imdb.com/title/tt1711425
Country of origin.
Year of release.
Gross earnings at U.S. box office.
Director of film.
The number of subjects featured in the film.
The actual name of the featured subject.
The occupation of subject or reason for recognition.
Indicates whether the subject's race was discernible based on background of self, parent, or grandparent.
Race of the subject.
Dummy variable that indicates person of color.
Sex of subject.
The actor or actress who played the subject.