{"id":153,"date":"2020-01-28T12:42:39","date_gmt":"2020-01-28T12:42:39","guid":{"rendered":"http:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/?p=153"},"modified":"2020-05-01T14:13:37","modified_gmt":"2020-05-01T14:13:37","slug":"extreme-value-theory-predicting-the-ultra-rare","status":"publish","type":"post","link":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/2020\/01\/28\/extreme-value-theory-predicting-the-ultra-rare\/","title":{"rendered":"Extreme value theory: predicting the ultra rare"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"153\" class=\"elementor elementor-153\">\n\t\t\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-4148b5b3 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"4148b5b3\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-1ae23930\" data-id=\"1ae23930\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-2745b6f1 elementor-widget elementor-widget-text-editor\" data-id=\"2745b6f1\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p><\/p>\n<p>Extreme value theory is a really exciting \u2014 and kind of astonishing \u2014 area of statistics. This is because it can tell us about the probability of events happening that are so rare there is barely any data recorded on them.<\/p>\n<p><\/p>\n<p>This seems perverse. Very broadly, traditional statistics says that we may not able to make accurate predictions about what may happen on an individual level (for example, how tall one puppy may grow to be in adulthood). But, if we look at a large population (the development of large numbers of puppies) we can get an idea of the range that we expect the majority to be in.<\/p>\n<p><\/p>\n<p>With extreme value theory, we are not interested in the behaviour of the majority. We want to look at the likelihood of a very, very rare event happening. Such as a Dachshund puppy that grows to be bigger than a Doberman.<\/p>\n<figure id=\"attachment_161\" aria-describedby=\"caption-attachment-161\" style=\"width: 300px\" class=\"wp-caption alignright\"><img fetchpriority=\"high\" decoding=\"async\" class=\"size-medium wp-image-161\" src=\"http:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/wp-content\/uploads\/sites\/14\/2020\/01\/Dach_pup_USE-300x199.jpg\" alt=\"\" width=\"300\" height=\"199\" srcset=\"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/wp-content\/uploads\/sites\/14\/2020\/01\/Dach_pup_USE-300x199.jpg 300w, https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/wp-content\/uploads\/sites\/14\/2020\/01\/Dach_pup_USE.jpg 425w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><figcaption id=\"caption-attachment-161\" class=\"wp-caption-text\">A Dachsund puppy<\/figcaption><\/figure>\n\n<p>\u00a0<\/p>\n<p>Why would we want to know that? Well, let\u2019s say you own a VW Beetle and you want to buy a Dachshund puppy. Your family is fiercely attached to the car and will only agree to getting a puppy if it means you will not need to sell the car.<\/p>\n<p><\/p>\n<p>You are pretty sure this will not happen. You promise them this could never happen. But then you start to worry: <em>could <\/em>the puppy grow to be too big to fit in the car? You\u2019ve never heard of \u2014 or seen \u2014 a Dachshund that\u2019s too big for a Beetle. But does that mean you can be certain?<\/p>\n<p><\/p>\n<p>The trouble with extreme events \u2014 from a statistical point of view \u2014 is that they do not happen very often, if at all. We might want to know the probability of a once in 1000 years type event. We do not have a large body of data that can give us steer on what and when these events might occur.<\/p>\n<p><\/p>\n<p>So, are we stuck?<\/p>\n<p><\/p>\n<p>No! Thanks to extreme value theory. \u00a0<\/p>\n<p><\/p>\n<p>Statisticians can focus on the tails of the data \u2014 meaning they can examine the events that have a very low probability of occurring. They usually do this in one of two ways in the univariate setting.*<\/p>\n<p><\/p>\n<p>We can look at maxima over a certain period of time. For example, we could group Dachshunds according to the year they were born, then record the tallest in each year group. Surprisingly (to me) these tend to a distribution (the Generalised Extreme Value distribution).<\/p>\n<p><\/p>\n<p>This is Very Good News in statistics. It means we have mathematically backed insight into the way the population of maxima behaves.<\/p>\n<p><\/p>\n<p>What if we had two Dachshunds born in 2015 that grew very big? If we were looking for maxima we would only count the largest one, so we would be cutting out a potentially useful bit of data. A method that gets around this issue is to look at exceedances \u2014 data points that come above a certain threshold.<\/p>\n<figure id=\"attachment_160\" aria-describedby=\"caption-attachment-160\" style=\"width: 300px\" class=\"wp-caption alignleft\"><img decoding=\"async\" class=\"size-medium wp-image-160\" src=\"http:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/wp-content\/uploads\/sites\/14\/2020\/01\/Dachshund_diff_sizes-300x148.png\" alt=\"\" width=\"300\" height=\"148\" srcset=\"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/wp-content\/uploads\/sites\/14\/2020\/01\/Dachshund_diff_sizes-300x148.png 300w, https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/wp-content\/uploads\/sites\/14\/2020\/01\/Dachshund_diff_sizes.png 662w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><figcaption id=\"caption-attachment-160\" class=\"wp-caption-text\">An unusually large puppy, with adult to scale<\/figcaption><\/figure>\n\n<p>\u00a0<\/p>\n<p>If we decide that any Dachshund taller than, say, 40cm is remarkable then we can look at the distribution of Dachshunds that exceed that level. This would give us data that accords to a Generalised Pareto Distribution.<\/p>\n<p><\/p>\n<p>One of the big academic issues here is choosing that threshold level: set it too high and you don\u2019t get much data. Set it too low and you are out of the tails of the distribution.\u00a0<\/p>\n<p><\/p>\n<p>These theories have important applications \u2014 beyond prospective dog owners with families that love their car <s>a little too much<\/s>.<\/p>\n<p><\/p>\n<p>Flood defences are one area where governments need to know what a really, really bad flood would look like and how to protect people from it. But, because flood defences are expensive, they also don\u2019t want to build ones that are bigger than necessary.<\/p>\n<p><\/p>\n<p>Finance is another area. How likely is an extreme financial or economic shock? What measures should be in place to ensure that institutions, and the financial system itself, can withstand it? Regulators would want to make sure they are not insisting on such strongly risk-averse measures that it is impossible for companies to make a profit.<\/p>\n<p><\/p>\n<p>*By univariate, I mean we are looking at just one variable. For example: height of dog, or observed temperatures or daily rainfall. We are not looking at several variables together (the multivariate setting).<\/p>\n<p>Want to know more? There is a whole journal dedicated to Extreme Value Theory. <a href=\"https:\/\/link.springer.com\/journal\/10687\/volumes-and-issues\">Here&#8217;s a link to past volumes and issues.\u00a0<\/a><\/p>\n<p><\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>Extreme value theory is a really exciting \u2014 and kind of astonishing \u2014 area of statistics. This is because it can tell us about the probability of events happening that are so rare there is barely any data recorded on them. This seems perverse. Very broadly, traditional statistics says that we may not able to make accurate predictions about what may happen on an individual level (for example, how tall one puppy may grow to be in adulthood). But, if we look at a large population (the development of large numbers of puppies) we can get an idea of the range that we expect the majority to be in. With extreme value theory, we are not interested in the behaviour of the majority. We want to look at the likelihood of a very, very rare event happening. Such as a Dachshund puppy that grows to be bigger than a Doberman. \u00a0 Why would we want to know that? Well, let\u2019s say you own a VW Beetle and you want to buy a Dachshund puppy. Your family is fiercely attached to the car and will only agree to getting a puppy if it means you will not need to sell the car. You are pretty sure this will not happen. You promise them this could never happen. But then you start to worry: could the puppy grow to be too big to fit in the car? You\u2019ve never heard of \u2014 or seen \u2014 a Dachshund that\u2019s too big for a Beetle. But does that mean you can be certain? The trouble with extreme events \u2014 from a statistical point of view \u2014 is that they do not happen very often, if at all. We might want to know the probability of a once in 1000 years type event. We do not have a large body of data that can give us steer on what and when these events might occur. So, are we stuck? No! Thanks to extreme value theory. \u00a0 Statisticians can focus on the tails of the data \u2014 meaning they can examine the events that have a very low probability of occurring. They usually do this in one of two ways in the univariate setting.* We can look at maxima over a certain period of time. For example, we could group Dachshunds according to the year they were born, then record the tallest in each year group. Surprisingly (to me) these tend to a distribution (the Generalised Extreme Value distribution). This is Very Good News in statistics. It means we have mathematically backed insight into the way the population of maxima behaves. What if we had two Dachshunds born in 2015 that grew very big? If we were looking for maxima we would only count the largest one, so we would be cutting out a potentially useful bit of data. A method that gets around this issue is to look at exceedances \u2014 data points that come above a certain threshold. \u00a0 If we decide that any Dachshund taller than, say, 40cm is remarkable then we can look at the distribution of Dachshunds that exceed that level. This would give us data that accords to a Generalised Pareto Distribution. One of the big academic issues here is choosing that threshold level: set it too high and you don\u2019t get much data. Set it too low and you are out of the tails of the distribution.\u00a0 These theories have important applications \u2014 beyond prospective dog owners with families that love their car a little too much. Flood defences are one area where governments need to know what a really, really bad flood would look like and how to protect people from it. But, because flood defences are expensive, they also don\u2019t want to build ones that are bigger than necessary. Finance is another area. How likely is an extreme financial or economic shock? What measures should be in place to ensure that institutions, and the financial system itself, can withstand it? Regulators would want to make sure they are not insisting on such strongly risk-averse measures that it is impossible for companies to make a profit. *By univariate, I mean we are looking at just one variable. For example: height of dog, or observed temperatures or daily rainfall. We are not looking at several variables together (the multivariate setting). Want to know more? There is a whole journal dedicated to Extreme Value Theory. Here&#8217;s a link to past volumes and issues.\u00a0<\/p>\n","protected":false},"author":8,"featured_media":164,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[4],"tags":[],"class_list":["post-153","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-statistics"],"_links":{"self":[{"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/wp-json\/wp\/v2\/posts\/153","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/wp-json\/wp\/v2\/comments?post=153"}],"version-history":[{"count":12,"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/wp-json\/wp\/v2\/posts\/153\/revisions"}],"predecessor-version":[{"id":301,"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/wp-json\/wp\/v2\/posts\/153\/revisions\/301"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/wp-json\/wp\/v2\/media\/164"}],"wp:attachment":[{"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/wp-json\/wp\/v2\/media?parent=153"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/wp-json\/wp\/v2\/categories?post=153"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/tessa-wilkie\/wp-json\/wp\/v2\/tags?post=153"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}