1 00:00:04,459 --> 00:00:20,519 Formative cycle, higher degree, marketing and advertising, professional module, commercial research, work unit 7, treatment and statistical analysis of the data, associated with the learning result 7. 2 00:00:24,739 --> 00:00:29,719 Objectives. Learn to calculate the basic measures of descriptive statistics. 3 00:00:29,719 --> 00:00:46,159 index 1 calculation of the average 2 calculation of the median in this video we are going to learn to calculate 4 00:00:46,159 --> 00:00:51,560 the average with simple data we see that I have a series of data and I want to calculate the average 5 00:00:51,560 --> 00:00:57,200 excel has the average function that automatically calculates the average for me I put the same average I 6 00:00:57,200 --> 00:01:04,340 open parentheses I select all the data I close parentheses I give it to intro and I already have the 7 00:01:04,340 --> 00:01:09,719 average for simple data. If we remember what the average is, it is the sum of all 8 00:01:09,719 --> 00:01:14,239 the data divided by the total number of data. We could check that this one 9 00:01:14,239 --> 00:01:22,299 has done it correctly. If we add all the data here with self-sum, I put here, I give it 10 00:01:22,299 --> 00:01:29,719 self-sum and automatically I get 193 and to calculate it is equal to the sum of 11 00:01:29,719 --> 00:01:35,599 all the data 193 divided by the total number of data which is 18 and we have 12 00:01:35,599 --> 00:01:41,780 checked that the calculated average comes out perfectly as the definition we are going to see 13 00:01:41,780 --> 00:01:48,079 now the calculation of the average in grouped data we want to obtain information about the level 14 00:01:48,079 --> 00:01:55,400 of daily consumption of a certain product from the response given by 730 people 15 00:01:55,400 --> 00:02:01,280 that have been surveyed in several commercial centers the accumulated data obtained that 16 00:02:01,280 --> 00:02:07,239 are distributed by intervals in terms of daily weight consumption has been the one that 17 00:02:07,239 --> 00:02:12,979 is reflected in the table where we have daily consumption in interval and absolute frequency we see 18 00:02:12,979 --> 00:02:18,319 that I do not have the data as such for that reason I have to calculate the class mark which is x 19 00:02:18,319 --> 00:02:24,500 sub and which is the middle point of the interval how are we going to calculate it because to calculate it we have 20 00:02:24,500 --> 00:02:29,539 to add the ends of the interval in this case it is 1 plus 5 and the sum we 21 00:02:29,539 --> 00:02:35,120 divide by 2 and it would give us 3 and so with all the intervals in this case 6 22 00:02:35,120 --> 00:02:41,879 plus 10 divided by 2 gives us 8 then we already have the class mark the formula 23 00:02:41,879 --> 00:02:45,719 for the average calculation is the sum of the class mark that we already have 24 00:02:45,719 --> 00:02:52,180 by the frequency we have made a table here with that calculation of the 25 00:02:52,180 --> 00:02:59,680 class mark by the frequency and the add-on because we have done the self-sum gives us 10,705 divided 26 00:02:59,680 --> 00:03:05,979 by the total number of surveyed people, which in this case is the total absolute frequency, 27 00:03:05,979 --> 00:03:14,740 which is 730 and already the average because it gives me 15 and I would already have calculated the average for grouped data 28 00:03:16,400 --> 00:03:27,419 we are going to calculate now the median with simple data the median is the value that the 29 00:03:27,419 --> 00:03:31,719 central place of all the data when these are ordered to calculate it 30 00:03:31,719 --> 00:03:37,560 excel establishes the function of the median equal to median we open parentheses 31 00:03:37,560 --> 00:03:41,919 and select all the data that in this case are odd data I close 32 00:03:41,919 --> 00:03:46,539 parentheses I give it to intro and I have already calculated the median value for 33 00:03:46,539 --> 00:03:52,080 odd data we are going to calculate we are going to check that that value is correct and 34 00:03:52,080 --> 00:04:00,479 following the definition we are going to order the data of greater and we check that the central position 35 00:04:00,479 --> 00:04:07,180 is 10 therefore it has calculated the data of the median correctly with the function equal to 36 00:04:07,180 --> 00:04:12,900 the median I have 9 data above and 9 data below then the median is the value that 37 00:04:12,900 --> 00:04:20,759 occupies the central place and this is correct but what happens when the data is even when I have 38 00:04:20,759 --> 00:04:28,480 one more data, for example, we are going to add one more data so that they are even in this case we are going to 39 00:04:28,480 --> 00:04:35,519 calculate the median equal to the median I open parentheses and select all the even data I close 40 00:04:35,519 --> 00:04:41,160 parentheses and give it to intro it has returned me the value of the median which is 9 we are going to check 41 00:04:41,160 --> 00:04:50,129 if the function has done it correctly by ordering the data from minor to major the data 42 00:04:50,129 --> 00:04:58,050 that are in the central position are the data of position 10 and 11 in this case what we have 43 00:04:58,050 --> 00:05:04,709 to do is talk we do the average of the values ​​that are in the center in this case 44 00:05:04,709 --> 00:05:13,949 are 8 and 10 to calculate the average we divide between 2 and the average would be 9 with which it has 45 00:05:13,949 --> 00:05:21,629 correctly calculated the data of the median for even data and the median calculation would be done. 46 00:05:21,629 --> 00:05:29,790 We are now going to calculate the median in grouped data. In the same previous example in which 47 00:05:29,790 --> 00:05:34,629 we had calculated the median, we are going to calculate the median in the grouped data table. 48 00:05:34,629 --> 00:05:41,290 We are going to calculate the median using this formula that we had already explained in the previous video. 49 00:05:41,290 --> 00:05:47,610 When we have ungrouped data, the median is the value that is in the middle when we have 50 00:05:47,610 --> 00:05:53,889 ordered the data from lower to higher or from higher to lower value. The problem here is that we do not have 51 00:05:53,889 --> 00:06:00,129 the data as such, we have an absolute frequency table and we cannot calculate it that way. 52 00:06:00,129 --> 00:06:07,149 For this, what we are going to do is build a table of accumulated frequencies. To this column 53 00:06:07,149 --> 00:06:15,110 we call it F capital, accumulated frequency. The first value is the value of 100, the second 54 00:06:15,110 --> 00:06:23,889 value would be the first value plus the second value and so on. The third value is the sum 55 00:06:23,889 --> 00:06:31,089 of the three values ​​of the frequencies and so on. The final number we obtain is the 56 00:06:31,089 --> 00:06:36,170 total number of the data of the table as we can check now well we are going to 57 00:06:36,170 --> 00:06:44,550 calculate n between 2 in this case in between 2 is equal to 730 between 2 that 58 00:06:44,550 --> 00:06:49,269 gives us a value of 365 we go to the column of 59 00:06:49,269 --> 00:06:53,970 accumulated frequencies and in this case we look for the first value that is 60 00:06:53,970 --> 00:07:04,490 greater than 365 which is 434 in 250 it is not greater than 365 so we pass this value therefore 61 00:07:04,490 --> 00:07:09,629 once we have identified the accumulated frequency we have also identified the 62 00:07:09,629 --> 00:07:17,490 value of the interval of the median which would be between 11 and 15 therefore l and is the first 63 00:07:17,490 --> 00:07:23,769 number of the interval that we have just found which would be 11 the capital F we have to 64 00:07:23,769 --> 00:07:29,329 remember that they refer to the accumulated frequencies and the f minuscule to the absolute frequency 65 00:07:29,329 --> 00:07:35,649 it is important that we distinguish this to then substitute in the formula therefore the f 66 00:07:35,649 --> 00:07:45,110 sub i minus 1 is the accumulated frequency of the previous interval, that is, 250 and the is the 67 00:07:45,110 --> 00:07:51,310 amplitude of the interval in this case is calculated subtracting the limits of the interval we would have 68 00:07:51,310 --> 00:07:57,009 have to subtract 15 minus 11 and it would be equal to 4 and now we already have all the data 69 00:07:57,009 --> 00:08:02,170 we need what we are going to do is simply substitute them in the formula so 70 00:08:02,170 --> 00:08:09,730 that the median would be equal to l sub 0 which would be 11 plus in between 2 which would be 365 minus 71 00:08:09,730 --> 00:08:19,870 f sub and minus 1 which would be 250 between f minuscule sub and which would be 184 and that gives us a value 72 00:08:19,870 --> 00:08:24,730 of the average of 13.5 and with this we would have done the calculation of the 73 00:08:24,730 --> 00:08:31,569 average in grouped data activity defines and calculates the average and the 74 00:08:31,569 --> 00:08:35,909 average from the following statistics table 75 00:08:35,909 --> 00:08:40,230 activity calculates the average and the average from the 76 00:08:40,230 --> 00:08:44,549 following statistics table daily consumption of milk and absolute frequency with 77 00:08:44,549 --> 00:08:47,750 the following data