{"id":451,"date":"2021-05-14T12:56:30","date_gmt":"2021-05-14T12:56:30","guid":{"rendered":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/martin-dimitrov\/?p=451"},"modified":"2021-05-14T15:04:21","modified_gmt":"2021-05-14T15:04:21","slug":"resampling-techniques","status":"publish","type":"post","link":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/martin-dimitrov\/2021\/05\/14\/resampling-techniques\/","title":{"rendered":"Resampling techniques"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">I<span class=\"has-inline-color has-black-color\">magine that you have a weighted sample of size <span class=\"wp-katex-eq\" data-display=\"false\">N<\/span> and you want to obtain another sample of the same size, consisting of the same elements. One way to do that is by doing a miltinomial sampling, where we sample each point (or particle if we deal with particle filters) proportional to its weight. That way, we will remove particles within the sample that have small weights and resample more the points with higher weights. This, however, reduces the diversity of the sample. What is more, even if all points within the sample have the same weight <span class=\"wp-katex-eq\" data-display=\"false\">1\/N<\/span>,  there is a high probability that some of them will not be resampled at all. That why we devote this blog to sampling techniques.<\/span><\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><span class=\"has-inline-color has-black-color\"><strong>Stratified\/Systematic Resampling<\/strong><\/span><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\"><span class=\"has-inline-color has-black-color\">Imagine that we are trying to find a better resampling technique for the particle filters discussed in the previous blogs. Then at each time observation, we will have <span class=\"wp-katex-eq\" data-display=\"false\">N<\/span> particles and their corresponding weights. Stratified resampling essentially divides the population of particles into sections (subpopulations) called strata, <a rel=\"noreferrer noopener\" href=\"https:\/\/www.researchgate.net\/publication\/274404127_Resampling_Methods_for_Particle_Filtering_Classification_implementation_and_strategies\" data-type=\"URL\" data-id=\"https:\/\/www.researchgate.net\/publication\/274404127_Resampling_Methods_for_Particle_Filtering_Classification_implementation_and_strategies\" target=\"_blank\">[1]<\/a>. To help us visualise the idea, imagine the normalised weights as sections, such that each section has the length of the corresponding weight, see the figure below. For example, the first particle will have a weight equal to length of the first blue rectangle.<\/span><\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><img decoding=\"async\" width=\"523\" height=\"85\" src=\"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/martin-dimitrov\/wp-content\/uploads\/sites\/31\/2021\/05\/SR.png\" alt=\"\" class=\"wp-image-456\" srcset=\"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/martin-dimitrov\/wp-content\/uploads\/sites\/31\/2021\/05\/SR.png 523w, https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/martin-dimitrov\/wp-content\/uploads\/sites\/31\/2021\/05\/SR-300x49.png 300w\" sizes=\"(max-width: 523px) 100vw, 523px\" \/><figcaption><span class=\"has-inline-color has-black-color\"><strong>Partitioning the [0.1] interval<\/strong><\/span><\/figcaption><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><span class=\"has-inline-color has-black-color\">Now, since the sections are equal to the normalised weights, their total length should add up to <span class=\"wp-katex-eq\" data-display=\"false\">1<\/span>. In the figure, we have considered <span class=\"wp-katex-eq\" data-display=\"false\">5<\/span> particles, but you can imagine a lot more, we have just plotted the strata for the first <span class=\"wp-katex-eq\" data-display=\"false\">5<\/span>. The bottom rectangle represents the interval <span class=\"wp-katex-eq\" data-display=\"false\">[0,1]<\/span>, but split into equal sections of length <span class=\"wp-katex-eq\" data-display=\"false\">1\/N<\/span>, where <span class=\"wp-katex-eq\" data-display=\"false\">N<\/span> is the number of particles. Then, the stratified resampling will sample a uniform random point in each of the small rectangles on the bottom and see which particle it corresponds to. Naturally, this is eliminate the particles with very small weights, but will even out to some extent their sampling. For example, if all the particles had the same weight, then in the multinomial resampling there is a high probability that some of them will not be resampled, which is not the case for the Stratified resampling, ensuring the diversity of the particles is not deteriorating. Mathematically, Stratified resampling partitions the <span class=\"wp-katex-eq\" data-display=\"false\">[0,1]<\/span> interval into <span class=\"wp-katex-eq\" data-display=\"false\">N<\/span> disjoint subintervals: <span class=\"wp-katex-eq\" data-display=\"false\">[0, 1\/N) \\cup [1\/N, 2\/N) \\cup \\ldots \\cup [1-1\/N, 1]<\/span>. Then, sample from <span class=\"wp-katex-eq\" data-display=\"false\">N<\/span> random variables, <span class=\"wp-katex-eq\" data-display=\"false\">\\{u_{j}^{i}\\}_{i=1}^{N}<\/span> independently in each interval at time <span class=\"wp-katex-eq\" data-display=\"false\">j<\/span>, i.e.:<\/span><\/p>\n\n\n\n<p class=\"has-text-align-center wp-block-paragraph\"><span class=\"has-inline-color has-black-color\"><span class=\"wp-katex-eq\" data-display=\"false\">u_{j}^{i} \\sim U\\Big[\\frac{i-1}{N}, \\frac{i}{N}\\Big), \\quad \\text{for} \\; i \\in {1,\\ldots,N}<\/span>.<\/span><\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><span class=\"has-inline-color has-black-color\">Then we can obtain the resampled particles following the argument above. On the other hand, <strong>systematic resampling<\/strong>, see <a rel=\"noreferrer noopener\" href=\"https:\/\/digital-library.theiet.org\/content\/journals\/10.1049\/ip-rsn_19990255\" data-type=\"URL\" data-id=\"https:\/\/digital-library.theiet.org\/content\/journals\/10.1049\/ip-rsn_19990255\" target=\"_blank\">[2]<\/a>, exploits the strata in a different way, where the only the first random variable is drawn from a uniform distribution, while the others are determined deterministically by the following procedure:<\/span><\/p>\n\n\n\n<p class=\"has-text-align-center wp-block-paragraph\"><span class=\"has-inline-color has-black-color\"><span class=\"wp-katex-eq\" data-display=\"false\">u_{j}^{1} \\sim U\\Big[0, \\frac{1}{N}\\Big),<\/span><\/span><\/p>\n\n\n\n<p class=\"has-text-align-center wp-block-paragraph\"><span class=\"has-inline-color has-black-color\"><span class=\"wp-katex-eq\" data-display=\"false\">u_{j}^{i} = u_{j}^{1} + \\frac{i - 1}{N}, \\quad \\text{for} \\; i \\in {1,\\ldots,N}<\/span><\/span><\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><span class=\"has-inline-color has-black-color\">Now, both stratified and systematic resampling turn out to be of <span class=\"wp-katex-eq\" data-display=\"false\">\\mathcal{O}(N)<\/span>, but the systematic resampling is more computationally efficient as it requires smaller number of random variables to be generated.<\/span><\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong><span class=\"has-inline-color has-black-color\">Residual Resampling<\/span><\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\"><span class=\"has-inline-color has-black-color\">Residual resampling consists of two stages, see <a rel=\"noreferrer noopener\" href=\"https:\/\/ieeexplore.ieee.org\/abstract\/document\/570818\" data-type=\"URL\" data-id=\"https:\/\/ieeexplore.ieee.org\/abstract\/document\/570818\" target=\"_blank\">[3]<\/a>, <a rel=\"noreferrer noopener\" href=\"https:\/\/www.tandfonline.com\/doi\/abs\/10.1080\/01621459.1998.10473765\" data-type=\"URL\" data-id=\"https:\/\/www.tandfonline.com\/doi\/abs\/10.1080\/01621459.1998.10473765\" target=\"_blank\">[4]<\/a> and <a rel=\"noreferrer noopener\" href=\"https:\/\/ieeexplore.ieee.org\/abstract\/document\/4378824\" data-type=\"URL\" data-id=\"https:\/\/ieeexplore.ieee.org\/abstract\/document\/4378824\" target=\"_blank\">[5]<\/a>. The first stage of the algorithm is to determine which particles have a weight higher than <span class=\"wp-katex-eq\" data-display=\"false\">1\/N<\/span>. Then, it will sample each such particle <span class=\"wp-katex-eq\" data-display=\"false\">N_{j}^{i}<\/span> times, where <span class=\"wp-katex-eq\" data-display=\"false\">N_{j}^{i} = floor(Nw_{j}^{i})<\/span>. The total number of the replicated particles after the first stage will be <span class=\"wp-katex-eq\" data-display=\"false\">N_{j} = \\sum_{i=1}^{N}N_{j}^{i}<\/span>. The second stage consists of replicating <span class=\"wp-katex-eq\" data-display=\"false\">R_{j} = N - N_{j}<\/span> number of particles, using a multinomial sampling, but from new (residual) weights, given by (unnormalised):<\/span><\/p>\n\n\n\n<p class=\"has-text-align-center wp-block-paragraph\"><span class=\"has-inline-color has-black-color\"><span class=\"wp-katex-eq\" data-display=\"false\">\\hat{w}_{j}^{i} = w_{j}^{i} - \\frac{N_{j}^{i}}{N}<\/span>.<\/span><\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><span class=\"has-inline-color has-black-color\">Intuitively, if a particle has been resampled at the first stage, then its residual weight will be reduced, leading to a lower probability of it being resampled again. If the particle has not been resampled in the first stage, then its residual weight the maximum one for this particular particle (in other words the original), hence it has a higher change of being resampled at the second stage. This ensures particle diversity while at the same time removing the particles with negligible weights. Due to the two loop structure, the time complexity of the residual resampling is <span class=\"wp-katex-eq\" data-display=\"false\">\\mathcal{O}(N) + \\mathcal{O}(R_{j})<\/span>. However, one can remove the computatinally expensive multinomial resampling by conbining the two loops into one. The method is called <em>residual systematic resampling <\/em>(RSR). In essence, RSR is very similar to systematic resampling. The difference lies in that systematic resampling the random variables <span class=\"wp-katex-eq\" data-display=\"false\">{u_{j}^{i}}_{i = 1}^{N}<\/span> are determined once we know <span class=\"wp-katex-eq\" data-display=\"false\">u_{j}^{1}<\/span>, while in RSR, <span class=\"wp-katex-eq\" data-display=\"false\">u_{j}^{i}<\/span> is updated with reference to the currently considered weight, and for this reason the value of the weight of the previous particle needs to be subtracted from <span class=\"wp-katex-eq\" data-display=\"false\">u_{j}^{i}<\/span>.<\/span><\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><span class=\"has-inline-color has-black-color\"><strong>Varying sample size<\/strong><\/span><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\"><span class=\"has-inline-color has-black-color\">Up to know we have considered different resampling methods where at each time observation we have a choice which results in either propagating the same paths (if the ESS is above some threshold) or resampling the same number of paths that we started with. There could be instances, however, where it could be useful sample more or less particles. Imagine for example, that from one time observation to the next, your ESS deteriorates drastically . This means that we are not exploring the space so much and resampling more particles could be desirable. On the contrary, we also saw that sampling too many particles might only lead to incremental improvements, while linearly increasing the computational time, hence it might be desirable sometimes to sample less particles. One approach, called <em><a rel=\"noreferrer noopener\" href=\"https:\/\/www.sciencedirect.com\/science\/article\/pii\/S0167278906003204\" data-type=\"URL\" data-id=\"https:\/\/www.sciencedirect.com\/science\/article\/pii\/S0167278906003204\" target=\"_blank\">branch-kill<\/a><\/em> or <em><a rel=\"noreferrer noopener\" href=\"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s004400050249.pdf\" data-type=\"URL\" data-id=\"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s004400050249.pdf\" target=\"_blank\">branching<\/a><\/em>, replicates the <span class=\"wp-katex-eq\" data-display=\"false\">i^{th}<\/span> particle at time <span class=\"wp-katex-eq\" data-display=\"false\">j<\/span>, <span class=\"wp-katex-eq\" data-display=\"false\">N_{j}^{i} = floor(Nw_{j}^{i}) + 1<\/span> times with probability <span class=\"wp-katex-eq\" data-display=\"false\">p<\/span>, or <span class=\"wp-katex-eq\" data-display=\"false\">N_{j}^{i} = floor(Nw_{j}^{i})<\/span> with probability <span class=\"wp-katex-eq\" data-display=\"false\">1-p<\/span>, where <span class=\"wp-katex-eq\" data-display=\"false\">p = Nw_{j}^{i} - floor(Nw_{j}^{i})<\/span>. <\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Imagine that you have a weighted sample of size and you want to obtain another sample of the same size, consisting of the same elements. One way to do that is by doing a miltinomial sampling, where we sample each point (or particle if we deal with particle filters) proportional to its weight. That way,&hellip; <br \/> <a class=\"read-more\" href=\"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/martin-dimitrov\/2021\/05\/14\/resampling-techniques\/\">Read more<\/a><\/p>\n","protected":false},"author":32,"featured_media":452,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-451","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/martin-dimitrov\/wp-json\/wp\/v2\/posts\/451","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/martin-dimitrov\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/martin-dimitrov\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/martin-dimitrov\/wp-json\/wp\/v2\/users\/32"}],"replies":[{"embeddable":true,"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/martin-dimitrov\/wp-json\/wp\/v2\/comments?post=451"}],"version-history":[{"count":12,"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/martin-dimitrov\/wp-json\/wp\/v2\/posts\/451\/revisions"}],"predecessor-version":[{"id":471,"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/martin-dimitrov\/wp-json\/wp\/v2\/posts\/451\/revisions\/471"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/martin-dimitrov\/wp-json\/wp\/v2\/media\/452"}],"wp:attachment":[{"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/martin-dimitrov\/wp-json\/wp\/v2\/media?parent=451"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/martin-dimitrov\/wp-json\/wp\/v2\/categories?post=451"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.lancaster.ac.uk\/stor-i-student-sites\/martin-dimitrov\/wp-json\/wp\/v2\/tags?post=451"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}