Você está na página 1de 4

CHALLENGE 1

1. Introduction

In this challenge you will be requested to create a standard spreadsheet that could be used
to analyze real data at Seazone, one of the largest short-term rentals management
companies in Brazil. To do so, you’ll be provided with two ‘csv’ datasets extracted directly
from our database.

The standard spreadsheet you will develop shall consist of an ‘xlsx’ file with three or four
tabs (the last one is optional). One of them must contain a customized database that you
will have to build using the two provided datasets. The other three tabs are going to be
developed entirely by you, according to some requirements of our CEO.

2. First Tab - Database

Take some time to carefully visualize the two csv files.

The first one represents a series of calendars from different properties managed by
Seazone. Although it doesn’t look like a calendar (since it is a dataset) you can notice that
each row represents a specific date of a specific property ID.

The columns of this first dataset are the following:

1 - Property ID
2 - Calendar Date
3 - Last Daily Price
4 - Occupancy (Dummy)
5 - Blocking (Dummy)
6 - Creation Date

The second dataset has some specific characteristics of each property.

The columns of this second dataset are the following:

1 - Property ID
2 - Location Code
3 - Number of Bedrooms
4 - Status
As mentioned before, this first tab must contain all the base data you will use in the next
three tabs. You will probably have to wrangle the data of the csv files and create one or
more new datasets. Feel free to customize this new set (or sets) the way you find it more
convenient. Notice that you can create new columns filled with formulae. Just be sure to
avoid irrelevant data. The cleaner the better!

3. Second Tab - Calendar

In this task you will have to ‘transform’ the calendar of the csv file into an actual calendar
that could be visualized. Our CEO requires this calendar to have a similar shape to other
multicalendars we can find in the market. Here is an example from Hostfully:

As it can be visualized, this calendar is actually a table in which the properties are rows and
the dates are columns. The calendar dates that have a specific event, such as a booking or
a blocking, can be colored. The value of each booked day can be shown as well. You can
use your creativity to extract any visual aspect of your data.

And of course, our CEO doesn’t expect us to replicate Hostfully’s calendar. But he wants to
be able to visualize the calendar situation of each property for a specific month and year
he inputs somewhere in the tab. Be sure to create these input fields!

Notice that some months have less than 31 days. It’s not a problem if the column of day 31
still shows after you input a month with 30 days. Just be sure no data is returned for the
properties in this column.
4. Third Tab - Dashboard

This specific tab is expected to be very simple, but also very powerful. Our CEO wants to
just input a period between two dates and a specific location, after that being able to
visualize some important KPIs according to these previous inputs. As an example, consider
the period between 12/28/2021 and 1/4/2022 in location JUR. We should be able to see
how this region performed around New Year’s Eve in 2021 - 2022.

The KPIs you are expected to show through a set of customized formulae (regarding just
the imputed period and location) are the following:

1 - Number of occupied dates


2 - Number of blocked dates
3 - Number of available dates
4 - Number of check-ins
5 - Average precedence of the bookings, i.e. average difference between booking creation
dates and check-in dates
6 - Total revenue of the booked dates
7 - Percentage of total revenue of the imputed region in relation to all other regions
8 - Number of dates booked during the imputed period (for future dates, staring the day
after
the last indicated date of the period)
9 - Generated revenue through the booked dates of the last topic

Use your imagination to create this dashboard the way you find it more useful and
understandable. You can create graphics and customize the cells of your dashboard if you
want, just guaranteeing you will provide our CEO with the information highlighted in the
topics before. He would surely value your creativity and objectivity!

5. Fourth Tab - Occupancy Analysis (Optional)

The last tab of the standard spreadsheet is optional. If you decide to develop it, it shall
consist of a graphical representation of the occupancy, which is one of the most important
parameters of the short-term rentals market.

You are expected to create three graphics, all of them referring to the past 30 days
(consider that the present day is 3/1/22). Differently from the previous tasks, there won’t be
a period nor a location to be imputed. The only two input data in this tab are going to be
today's date (which should be assumed as the first of March, 2022) and a unique date
specifically used in the last graphic, as we will see.
The description of the three graphics to be developed are the following:

1 - A graphic that shows how the occupancy rate varies from day to day during the last 30
days. Notice that the occupancy rate is a percentage of the total occupancy in relation to all
available calendar dates, i.e. booked properties versus all properties that aren’t blocked.
2 - A graphic that shows how many future dates were booked in each of the last 30 days.
Consider ‘future dates’ as dates that come after each day indicated in the graphic.
3 - A dynamic graphic that shows how the occupancy rates for the last 30 days were on a
different day in the past. This last graphic is very similar to the first one, since the
information level is the same. But you should be able to visualize how the occupancy rates
were for this very same period on a day that is imputed. Our CEO might want to analyze
how the situation for the last 30 days was before this period was about to start, that is, on
30/1/22 for example.

6. Final Thoughts

This challenge was developed in order to evaluate your skills on Analytics and Creativity.
You will be judged according to:

1 - the efficiency of your formulae and graphics


2 - the correct comprehension of the tasks
3 - the organization of the spreadsheet
4 - the visual quality of the tabs
5 - extra functionalities you develop that improve the spreadsheet

Notice that almost all your tasks are going to be based on the data from the csv files. We
recommend you to take some time to really understand how the datasets work. Some
information might be irrelevant, others might be more important than you can imagine. And
don’t worry about hidden errors or missing values. We assure you that these files are
already cleaned and ready to be used.

Finally, keep your focus on what was asked in the tasks. As already mentioned, you can
come up with new ideas and interpretations of the datasets that can be exposed on the
spreadsheet. This kind of perception will be really appreciated. Just assure you do these
extras after you have already developed what was required beforehand.

Você também pode gostar