“A million dollars isn’t cool. You know what’s cool? A billion dollars.” — The Social Network (2010)
AirBnB completed an initial public offering in December 2020 and currently boasts a market capitalization of $100 billion. While I sadly missed out on this massive economic upside (hopefully you didn’t!), I am an active user and host of the AirBnB platform.
Naturally, I look to maximize value from the platform in both capacities. As part of my Udacity Introduction to Data Science NanoDegree, I chose to work with AirBnB Data from the Boston area to answer some questions to help assist in this mission.
Please note that I am considering moving to Boston (it is also conveniently one of the datasets provided by Udacity :D). Find three key questions that I am seeking to explore in this dataset below:
- What neighborhoods in Boston have the best value for money?
- What factors impact price per lodging?
- What factors impact ratings on lodgings?
With regards to the first question, if you are looking to stay in the Boston area, the neighborhoods West Roxbury and Roslindale appear to offer the best value for money.
The below chart calculates the average price and rating per lodging grouped by neighbourhood:
With regards to the second question, it appears that hosts with lower acceptance rates, fast response times and lodgings within certain neighborhoods have the highest correlation with prices. Frankly, I think that low acceptance rates / fast response times are indicative of hosts that take a lot of pride in their lodging, and therefore, they are able to charge a premium for their lodgings, however, this would need to be studied further.
The below chart shows the variables that are most highly correlated to prices per lodging:
With regards to the third question, it appears that acceptance rates have a high correlation to ratings. While 47% and 50% are lower acceptance ratings with strong correlations to ratings, on the aggregate, these acceptance rates look to be higher than the ones with high correlations to price, so perhaps hosts with higher acceptance rates score higher on ratings (potentially they are less high maintenance / picky) and this general sentiment of being relaxed scores well with guests. This would clearly need to be investigated further.
The below chart shows the variables that are most highly correlated to ratings per lodging:
Do you agree with these assessments? Are there other questions I should explore in future versions of this project?
Let me know and best of luck maximizing value as a user / host on the AirBnB platform!