The data awakens, from galaxy far, far away…
Since its debut in 1977, Star Wars has become a global phenomenon with millions of fan base and cult followers. Its success has hit the mainstream and has been one of the most important factor in the modern pop-culture. As a result, there are more than plenty of research and survey conducted from time to time to explore the result of modern population and their thoughts toward this franchise.
Now, let us proceed through the exploration of data and analysis in this survey…
Analysis
First, let us have a quick look at demographic detail as below…
Age | Gender | n |
---|---|---|
> 60 | Female | 118 |
> 60 | Male | 101 |
18-29 | Female | 90 |
18-29 | Male | 78 |
30-44 | Female | 117 |
30-44 | Male | 118 |
45-60 | Female | 117 |
45-60 | Male | 114 |
In Table Table 1, we can see that distribution of respondents are pleasantly surprising, in which they are well-equally distributed across most age and gender range. However, two noticeable points are distribution of Age above 60 and 18-29, which female respondent dominated that of male by some margin; as well as total number of respondent are different in between them as well.
Age | Gender | n |
---|---|---|
30-44 | Male | 81 |
45-60 | Male | 64 |
18-29 | Male | 54 |
45-60 | Female | 53 |
30-44 | Female | 50 |
> 60 | Male | 48 |
18-29 | Female | 43 |
> 60 | Female | 41 |
Age | Gender | n |
---|---|---|
> 60 | Female | 39 |
45-60 | Female | 39 |
45-60 | Male | 36 |
> 60 | Male | 29 |
30-44 | Female | 29 |
18-29 | Female | 28 |
30-44 | Male | 21 |
18-29 | Male | 17 |
Furthermore, if we classify these demographical data by fans and non-fans criteria, both in Table Table 2 and Table Table 3 with direct comparison, it appears that majority of people who considered themselves as Star Wars fans are older age male, with combined substaintially more than half of the respondent pool in fans category. On the contrary, those who considered themselves non-fans of this Sci-Fi franchise are majorly older age female. Still, it seems that almost have of the total respondents considered themselves as Star Wars fans significantly more than non-fans (n = 434 for Fans and n = 238 for Non-Fans, respectively)
Film Ranking | Fanology | Number |
---|---|---|
EP4-EP5-EP6-EP1-EP2-EP3 | Fans Chosen | 47 |
EP5-EP6-EP1-EP2-EP3-EP4 | Fans Chosen | 27 |
EP1-EP2-EP3-EP4-EP5-EP6 | Fans Chosen | 19 |
EP4-EP5-EP6-EP1-EP2-EP3 | Non-Fans Chosen | 17 |
EP5-EP6-EP1-EP2-EP3-EP4 | Non-Fans Chosen | 37 |
EP1-EP2-EP3-EP4-EP5-EP6 | Non-Fans Chosen | 51 |
Lastly, as we take a glance on the two opposite thoughts of favorite films, based on fans and non-fans from Table Table 4, we can certainly see the real pattern of what truly distinguish between fans and non-fans of the series. As EP4-EP5-EP6-EP1-EP2-EP3 seems to be the most popular choice for majority of fans, majority of non-fans has chosen EP1-EP2-EP3-EP4-EP5-EP6 instead. To add on this, being a Star Wars fans myself, this ranking by majority of non-fans seem unthoughtful; since this is perhaps the most basic and spontaneous answer anyone can come up with by ranking films by chronological order. Thus, fans respondents seems to be legit and the same goes with non-fans.
Reference
Dowle M, Srinivasan A (2021). data.table: Extension of data.frame
. R package version 1.14.2, https://CRAN.R-project.org/package=data.table.
Garrett Grolemund, Hadley Wickham (2011). Dates and Times Made Easy with lubridate. Journal of Statistical Software, 40(3), 1-25. URL https://www.jstatsoft.org/v40/i03/.
H. Wickham. ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York, 2016.
Wickham H, Averick M, Bryan J, Chang W, McGowan LD, François R, Grolemund G, Hayes A, Henry L, Hester J, Kuhn M, Pedersen TL, Miller E, Bache SM, Müller K, Ooms J, Robinson D, Seidel DP, Spinu V, Takahashi K, Vaughan D, Wilke C, Woo K, Yutani H (2019). “Welcome to the tidyverse.” Journal of Open Source Software, 4(43), 1686. doi:10.21105/joss.01686 https://doi.org/10.21105/joss.01686.
Yihui Xie (2022). bookdown: Authoring Books and Technical Documents with R Markdown. R package version 0.28.
Zhu H (2021). kableExtra: Construct Complex Table with ‘kable’ and Pipe Syntax. R package version 1.3.4, https://CRAN.R-project.org/package=kableExtra.
Datasets
Data Description
List below are name of column (variables) and its remark with explanation of each column name meaning (Some of which are self-explanatory and can be interpreted without explanation will be omitted from including in this list)
- RespondentID - Unique number of respondent which corresponded to this survey
- Gender - Either categorized into “Male” or “Female”
- Age - including four ranges: 18-29, 30-44, 45-60 and >60 years old.
- HHI - Stands for “Household Income”, including five ranges: “$0 - $24,999”, “$25,000 - $49,999”, “$50,000 - $99,999”, “$100,000 - $149,999”, “$150,000+”
- Education - Respondents’ latest education degree
- Location - Classified by region in USA
- n - number of respondent