Practice Excel Assignment Walkthrough

The following instructions are intended to guide INSS 300 students through the Practice Excel Assignment. These instructions are focused solely on this assignment, and not designed to be a comprehensive introduction to all of Excel’s features. They assume the use of an Excel version that comes with Microsoft Office 365 ProPlus, available to all University of Baltimore students under the Office 365 license. With other versions of Excel, menus and options may work differently from what has been described here.

Remember to periodically save your changes.

Initial Steps

o   From the posted assignment, right-click the data file ORDERS_Truncated.xlsx, click ‘Save Link As…’ and save it to your computer with the name <yourfirstname_<yourlastname>_Practice_Excel_Assignment.xlsx. For example, Rajesh_Mirani_Practice_Excel_Assignment.xlsx. Make a note of the drive and directory pertaining to the saved location.

o   In Windows, display the drive and directory corresponding to your saved location, and double-click the name of your saved data file so that it opens in Excel, as follows.



o   Your Excel spreadsheet currently comprises of a single worksheet labeled ‘Sheet1.’ Scroll up and down as well as left and right in Sheet1 to obtain a sense of its contents. As described in the assignment, this worksheet comprises of 4069 rows, including a header row, and 22 columns A through V. Note how the contents of each column relate to the column’s definition in the assignment. You may fast-scroll to various row and column extremities by using keyboard combinations such as CTRL+HOME, CTRL+END, CTRL+LEFT ARROW, CTRL+RIGHT ARROW, CTRL+UP ARROW, AND CTRL+DOWN ARROW.

Data Preparation

Add the following new columns to the right end of the Excel spreadsheet, and populate them with computed data by applying appropriate formulas, based on the following information.

 

             i.   NET_REVENUE in Column W, defined as: (Extended Price in column O times Shipped Quantity in column S) minus Return Revenue in column V.


To accomplish step (i), scroll over to the cell W1, type NET_REVENUE into that cell, and hit ENTER.



Note that the width of column W is less than the width of the text you just typed into cell W1. You may automatically widen column W just enough to accommodate your typed text by hovering your cursor at the boundary between the headers of column W and column X such that the cursor’s shape changes to a double-headed arrow. Keeping the cursor in place with its new shape, double-click the mouse. Note how the width of column W increases just enough to accommodate all currently typed text in that column.

 

 

 

 

 

 

 

Now highlight column W with a single click on the W header. Then click the HOME tab in the menu across the top, and finally click the $ symbol located in the top center of the screen to format all values to be entered in this column as dollars.

 

Next, place your cursor in cell W2, type the formula “=(O2*Q2)-V2” without the quotes, and hit ENTER.

 

 

 

 

 

 

 

Finally, copy the formula you just entered in cell W2 to cells W3 through W4069. To do so, click cell W2 once, then position the cursor on the dot in the lower right-hand corner of that cell such that the cursor’s shape changes to a ‘plus’ symbol. Keeping the cursor in place with its new shape, double-click the mouse to automatically copy the formula contained in cell W2 into cells W3 through W4069.


 

 

 

 

 



You may click the function key F2 in each cell to note how the copied formulas now reference their respective rows. To exit a formula and return to the computed cell value, press the Esc key.




 

            ii.   NET_PROFIT in Column X, defined as: (Shipped Quantity in column S minus Returned Quantity in column U) times (Extended Price in column O minus Extended Cost in column N).


Apply your learning from step (i) to accomplish this step.

 

           iii.   SALE_YEAR in Column Y, defined as: The four digit year contained in the Order date in column M.

The formula you will need to enter in cell Y2 is “=YEAR(M2)” without the quotes.

Apply your learning from step (i) to accomplish other parts of this step.


          iv.   PERCENT_RETURNS in Column Z, defined as: Returned Quantity in column U divided by Shipped Quantity in column S, but only if Shipped Quantity is greater than zero. (If Shipped Quantity is zero, the value in Column Z is to be entered as zero also.)

Excel’s built-in logical function IF is the best approach for putting together a formula here. The formula you will need to enter in cell Z2 is “=IF(S2<>0,U2/S2,0)” without the quotes. Its interpretation is as follows: If the value in cell S2 is not equal to zero, then compute Z2 as U2 divided by S2, else compute Z2 as zero.

Apply your learning from step (i) to accomplish other parts of this step.


           v.   NET_QUANTITY in Column AA, defined as: Shipped Quantity in column S minus Returned Quantity in column U.

 

Apply your learning from step (i) to accomplish this step.

 

          vi.   PRICE_CATEGORY in Column AB, defined as: ‘1’ if Extended Price in column O is less than $100, ‘2’ if Extended Price is at least $100 but less than $200, ‘3’ if Extended Price is at least $200 but less than $300, and ‘4’ if Extended Price is greater than $300.

Excel’s built-in function VLOOKUP is the best approach for putting together a formula here. VLOOKUP, or vertical lookup, searches an ascending list in the first column of a user-provided table, looking for a particular value. It starts its search in the top row of the first column, moving the search vertically down one row at a time. Upon finding the very first exact match or a higher value, it anchors itself to that row, and now looks horizontally across that matching row in order to retrieve the value of the cell contents of the second or third column, as requested.

 

To accomplish step (vi), therefore, we will create and define the following table in a new worksheet in order to apply VLOOKUP to it:

 

EXTENDED_PRICE

PRICE_CATEGORY

0

1

100

2

200

3

300

4



To do so, first click the ‘New Sheet’ button (i.e., the + symbol) next to the Sheet1 tab at the bottom of the screen.

 

 

In the resulting new Sheet2 worksheet, type this user-defined table starting at location A1, as follows:

 

 

 

Next, double-click the Sheet2 tab and change its label by typing the word PRICE_TABLE in its place. Hit ENTER when done.

 

 

Now click the Sheet1 tab to return to your original data. The formula you will now need to enter in cell AB2 is “=VLOOKUP(O2,PRICE_TABLE!$A$1:$B$5,2)” without the quotes.

 

In the above formula, O2 represents the price looked up.

PRICE_TABLE!$A$1:$B$5 represents the location of the user-defined lookup table. The $ symbols denote an absolute, fixed location for the lookup table.

The number 2 before the close of the parenthesis represents the column number whose cell contents are to be retrieved.

 

 

Apply your learning from step (i) to accomplish other parts of this step.

In summary, the six newly defined columns should appear as follows:

 

 

 

 

Analysis

1.    Copy the original sheet (‘Sheet1’) containing data in all of your columns A through AB into a new sheet called ‘Filter1. Then apply a filter to the copied data in this new sheet in order to answer the following question: Which year had the highest canceled quantity in a single order? Provide the pertinent details.

To start off this step, right-click the Sheet1 tab at the bottom, select ‘Move or Copy…’ and then finally place a check in the box labeled ‘Create a copy’ before clicking the OK button.  


This results in the creation of a new worksheet called ‘Sheet1 (2),’ containing a copy of the original data. Double-click the new tab and change its label from ‘Sheet1 (2)’ to ‘Filter1.’

We are now ready to answer Question 1.

In the Filter1 sheet, click the DATA tab in the menu across the top, then click the Filter symbol located near the top center of the screen. This should result in drop down arrows appearing on the right edges of each column header, as follows:


Click the drop down arrow next to the CANCEL_QUANTITY header in column H. From the resulting drop down box, unselect (i.e., uncheck) all items in the displayed list of all column H values, and then reselect only the highest value, i.e., 5. Finally click OK.


Note the lone order that appears on the screen containing this value of 5 under CANCEL_QUANTITY.


Using the information from the above analysis, answer Question 1, providing not only the year asked for (i.e., 2008), but also other pertinent details that support your answer, e.g., customer number, order number, and order date.

2.    Now consider ALL orders in the year you found in Question 1 above. Compare them collectively with orders in other years. Provide two meaningful observations about this particular year from your comparative analysis.

It is possible to answer this question in any number of ways, but we will demonstrate below how to compare the year 2008 with other years, by comparing total annual net profits in each year.

First, copy the original data in Sheet1 to another new sheet that we will call ANNUAL_PROFITS, using the method described under Question 1.

In this new ANNUAL_PROFITS sheet, click any cell containing data, then press CTRL+A on the keyboard to highlight all data in the sheet. Next, click
the DATA tab in the menu across the top, and then click the Sort symbol located to the left of the Filter symbol near the top center. This brings up the following screen:




Now click the arrow in the ‘Sort by’ box, select SALE_YEAR, and click OK:





The above operation results in a sorting of the rows in the ANNUAL_PROFITS sheet, by the SALE_YEAR field. Once again, press CTRL+A on the keyboard to highlight all data, and click Sort to bring up the Sort dialog box. After visually confirming that the SALE_YEAR sort entry from the previous step is showing, click the Add Level button:



Select NET_PROFIT as the second level sorting field and click OK:





Observe how the results of these two steps have resulted in a two-layered sorting of the rows in the ANNUAL_PROFITS sheet. All rows have first been sorted in ascending order by SALE_YEAR, and, within each year, by NET_PROFIT:




We will now insert yearly subtotal rows by NET_PROFIT into this sheet. To do so, click
the DATA tab in the menu across the top, then click the Subtotal menu option near the top right of the screen:




This brings up the Subtotal dialog box. In this box, select SALE_YEAR under ‘At each change in.’ Next, check the NET_PROFIT field under ‘Add subtotal to,’ and uncheck any other checked fields there. Finally make sure that there is a check in the ‘Summary below data’ check box, and click OK.




The above operation results in the insertion of new rows containing yearly subtotals for NET_PROFIT, after all the order records for each year. Scroll down the SALE_YEAR and NET_PROFIT columns to view these inserted rows. The following is a snapshot of the summary row for the year 2004.


You may conduct other analyses similar to the one above, by substituting NET_PROFIT with other measures of business performance such as NET_REVENUE. Using aggregate comparisons of the year 2008 with other years from all these analyses, answer Question 2, providing pertinent details as necessary to support your answer.

3.    Copy the original sheet (‘Sheet1’) containing data in all of your columns A through AB into a new sheet called ‘Filter2.’ Then apply a filter to the copied data in this new sheet in order to answer the following question: Which payment methods were used for orders with the highest net profit (top 1%)? Provide the pertinent details.

In order to answer this question, first copy the original data in Sheet1 to another new sheet labeled Filter2, using the method described under Question 1.

Following this, in the Filter2 sheet, click the Filter symbol to reveal the
drop down arrows on the right edges of each column header, also using the method in Question 1.

Next, click the drop down arrow next to the NET_PROFIT header in column X. In the resulting drop down box, click ‘Number Filters,’ and then ‘Top 10…,’:




In the resulting ‘Top 10 AutoFilter’ dialog box, select ‘Top,’ ‘1,’ and ‘Percent’ respectively, and click OK:




The above operation filters the view to only those rows containing the top 1% of values of NET_PROFIT, resulting in a display of only 40 of the 4068 records. You may now answer Question 3 by clicking the drop down arrow next to the PAY_METHOD header in column P and noting the values listed there. Provide any other pertinent details as necessary to support your answer.




4.    Copy the original sheet (‘Sheet1’) containing data in all of your columns A through AB into a new sheet called ‘Highlight.’ Then apply conditional formatting to the copied data in this new sheet in order to highlight orders made by division 05 where net profit was in the bottom 25% of all orders.

In order to answer this question, first copy the original data in Sheet1 to another new sheet labeled Highlight, using the method described under Question 1.

Next, click the cell X1 containing the column title NET_PROFIT, then press CTRL+A on the keyboard to highlight all data in the sheet. Next, click
the HOME tab in the menu across the top, and then click the Conditional Formatting symbol, followed by ‘Top/Bottom Rules,’ and ‘More Rules…’




This opens the ‘New Formatting Rule’ dialog box. In the top half of this dialog box, select ‘Use a formula to determine which cells to format.’ In the bottom half, enter the formula =AND($X1<=PERCENTILE($X$2:$X$4069,0.25),$K1="05") into the text box labeled ‘Format values where this formula is true:’ and then click the Format button.


In the above formula, $X1 represents a relative reference to successive cells in column X, starting with the first cell X2 in the range X2 through X4069. The ‘1’ in $X1 denotes the first cell of that range.

0.25 represents the bottom 25th percentile.

Similarly, $K1 represents a relative reference to successive cells in column K starting with row 2.

The logical operator AND combines the two conditions that govern the formatting rule.





In the resulting ‘Format Cells’ dialog box, select the color and font that you would like for the highlighted cells. Then click OK in the two open dialog boxes to complete the conditional formatting process.





The worksheet ‘Highlight’ should now highlight orders at division 05 where net profit was in the bottom 25% of all orders:



Create pivot charts to answer questions 5 through 8.

5.    In how many distinct years during the period 2005-2011 did product category E generate the highest total net revenue, compared to other product categories? Which years were they, and what were the total net revenues for product category E in those years?

To create this PivotChart, first return to Sheet1 containing the original data. Click any cell containing data, then click CTRL+A on the keyboard to highlight all data in Sheet1. Next, click the INSERT tab in the menu across the tab, then click PivotChart, and PivotChart once again.




In the resulting Create PivotChart dialog box, make sure that the Table/Range text box displays Sheet1!$A$1:$AB$4069 and that ‘New Worksheet’ has been selected underneath. Then click OK.




Note that Excel has created a new sheet called Sheet2 and placed an empty PivotChart in it. Next, you will specify the contents of this PivotChart, by dragging and dropping field names contained in the list under ‘PivotChart Fields’ on the right of the screen, into the four white spaces underneath respectively titled FILTERS, LEGEND, AXIS, and VALUES.




Before proceeding further, double-click the Sheet2 tab and rename it ‘Question 5.’

To answer Question 5, first drag NET_REVENUE from the list into the VALUES space, as follows.



Upon dropping NET_REVENUE into the VALUES space, it will show as ‘Sum of NET_REVENUE.’ Next, drag and drop SALE_YEAR into the AXIS space, and PRODUCT_CATEGORY_ID into the LEGEND space:




Our next step will be to restrict the years displayed on the X axis to 2005 through 2011. To do so, click the SALE_YEAR button located at the bottom left of the column chart, and in the resulting dialog box remove the checks corresponding to 2004 and 2012.




Clicking the OK button changes the chart to the following:




Now drag any of the four handles at the corners of this chart to enlarge it, so that it displays all product categories A through X:




You may now answer Question 5 by comparing the heights of the columns in this chart. Provide all pertinent details as necessary to support your answer.

6.    How does the answer to Question 5 above change if each of the four price categories in column AB are analyzed separately?

To answer Question 6, repeat the PivotChart creation process described in Question 5, saving your new PivotChart to a new sheet called ‘Question 6.’ After dragging and dropping NET_REVENUE, SALE_YEAR, and PRODUCT_CATEGORY_ID into the respective spaces, also drag and drop PRICE_CATEGORY into FILTERS:




Next, click the PRICE_CATEGORY button located at the top left of the column chart, and in the resulting dialog box select ‘1’ and click OK to display results pertaining only to price category 1 (i.e., products under $100):




Note how the heights of the various columns in the chart have changed as a result of filtering the view to sales of products under $100 only. Make your observations from this chart in order to answer Question 6:




Repeat the filtering process described above three more times, selecting price categories 2, 3, and 4 respectively in these other iterations. When done, answer Question 6 based on all of your observations from the four filtering iterations. Provide all pertinent details as necessary to support your answer.


7.    Which specific combination of division ID and sales channel had the highest average percent returns, compared to all other combinations of division IDs and sales channels (e.g., Web sales in division 01, or, mail order sales in division 05)? What was the percent value of this highest average return rate?

To answer Question 7, repeat the PivotChart creation process described in Question 5, saving your new PivotChart to a new sheet called ‘Question 7.’ Drag and drop both DIVISION_ID and CHANNEL into the AXIS space, and PERCENT_RETURNS into the VALUES space:




Since Question 7 asks about average PERCENT_RETURNS rather than total or sum of PERCENT_RETURNS, our next step is to change the display from ‘Sum of PERCENT_RETURNS’ to ‘Average of PERCENT_RETURNS.’ To do so, click ‘Sum of PERCENT-RETURNS’ in the VALUES space, then click ‘Value Field Settings…’ in the drop down box that appears. In the resulting ‘Value Field Settings’ dialog box, change the selection from Sum to Average and click OK:


Your chart should now change to the following display of ‘Average of PERCENT_RETURNS’:




You may now answer Question 7 by comparing the heights of the columns in this chart. Provide all pertinent details as necessary to support your answer.

 

8.    Create a bar or column chart of sum of net quantity on the vertical axis against price category on the horizontal axis. Do you observe a distinct pattern? Suggest a possible business or consumer rationale for this pattern.

To answer Question 8, repeat the PivotChart creation process described in Question 5, saving your new PivotChart to a new sheet called ‘Question 8.’ Drag and drop PRICE_CATEGORY into the AXIS space, and NET QUANTITY into the VALUES space. The resulting chart will appear as follows:




You may now answer Question 8 by observing a trend pertaining to changes in the heights of the columns in this chart. Provide all pertinent details as necessary to support your answer. Offer a possible rationale oriented to consumer spending habits in order to answer this question properly.

After completing this practice assignment, you may now attempt the actual Excel assignment.