{"id":8893,"date":"2019-07-01T06:00:52","date_gmt":"2019-07-01T06:00:52","guid":{"rendered":"http:\/\/sonicfrog.net\/?p=8893"},"modified":"2019-07-01T06:12:16","modified_gmt":"2019-07-01T06:12:16","slug":"the-perils-of-bad-data-and-bad-data-interpretation","status":"publish","type":"post","link":"https:\/\/sonicfrog.net\/?p=8893","title":{"rendered":"The Perils Of Bad Data And Bad Data Interpretation"},"content":{"rendered":"<div style=\"padding-bottom:20px; padding-top:10px;\" class=\"hupso-share-buttons\"><!-- Hupso Share Buttons - https:\/\/www.hupso.com\/share\/ --><a class=\"hupso_toolbar\" href=\"https:\/\/www.hupso.com\/share\/\"><img src=\"https:\/\/static.hupso.com\/share\/buttons\/share-medium.png\" style=\"border:0px; padding-top: 5px; float:left;\" alt=\"Share Button\"\/><\/a><script type=\"text\/javascript\">var hupso_services_t=new Array(\"Twitter\",\"Facebook\",\"Google Plus\",\"Pinterest\",\"Linkedin\",\"StumbleUpon\",\"Digg\",\"Reddit\",\"Bebo\",\"Delicious\");var hupso_background_t=\"#EAF4FF\";var hupso_border_t=\"#66CCFF\";var hupso_toolbar_size_t=\"medium\";var hupso_image_folder_url = \"\";var hupso_url_t=\"\";var hupso_title_t=\"The%20Perils%20Of%20Bad%20Data%20And%20Bad%20Data%20Interpretation\";<\/script><script type=\"text\/javascript\" src=\"https:\/\/static.hupso.com\/share\/js\/share_toolbar.js\"><\/script><!-- Hupso Share Buttons --><\/div>\n<p>A friend on Facebook posted a write-up <a href=\"https:\/\/www.americanthinker.com\/blog\/2019\/01\/texas_finds_95000_noncitizen_registrations_and_58000_illegal_votes_imagine_california.html?fbclid=IwAR1rnLVnhBXpN811hLw_xu7g96rqC5rYzdE4qZyhFY8WIiWN65QLGaL2u4U#.XRQBlj5uE1U.facebook\">in the American Thinker<\/a> about a report issued by the Texas Secretary of State earlier this year showing 58,000 illegals voted in Texas elections between 1996 and 2015. The entire thing was <a href=\"https:\/\/www.latimes.com\/politics\/la-na-pol-texas-voting-lawsuit-settlement-david-whitley-20190427-story.html?fbclid=IwAR24ep-FoDnC5c_yO7J7jz6N9KakX479tD0ENioxAqyPsx0Kym-GCIaJX7k\">completely discredited in court<\/a> due to bad methodologies. After the study was scrutinized, the number of non-citizens that were supposed to have been found voting in Texas elections went from 58,000 to about 80. My friend later posted a much better study on the topic. The better study provides\u00a0talking points on both sides of the political divide, that some non-citizens do vote in US elections, but on the other hand, the amount that do is quite small. Here&#8217;s the conclusion of the study.<\/p>\n\n\n\n<blockquote class=\"wp-block-quote\"><p>&#8220;&#8221;&#8221;Our exploration of non-citizen voting in the 2008 presidential election found that most non-citizens did not register or vote in 2008, but some did. The proportion of noncitizens who voted was less than fifteen percent, but<br>significantly greater than zero. Similarly in 2010 we found<br>that more than three percent of non-citizens reported<br>voting.<\/p><p>These results speak to both sides of the debate concerning non-citizen enfranchisement. They support the<br>claims made by some anti-immigration organizations<br>that non-citizens participate in U.S. elections. In addition,<br>the analysis suggests that non-citizens&#8217; votes have<br>changed significant election outcomes including the<br>assignment of North Carolina&#8217;s 2008 electoral votes, and<br>the pivotal Minnesota Senate victory of Democrat Al<br>Franken in 2008.<\/p><p>However, our results also support the arguments made<br>by voting and immigrant rights organizations that the<br>portion of non-citizen immigrants who participate in U.S.<br>elections is quite small. Indeed, given the extraordinary<br>efforts made by the Obama and McCain campaigns to<br>mobilize voters in 2008, the relatively small portion of noncitizens who voted in 2008 likely exceeded the portion of<br>non-citizens voting in other recent U.S. elections.&#8221;&#8221;&#8221;<\/p><\/blockquote>\n\n\n\n<p>The study above relies heavily on data from two studies by Stephen Ansolabehere (<a href=\"https:\/\/cces.gov.harvard.edu\/publications\/constituents-responses-congressional-roll-call-voting\">2010<\/a>, <a href=\"https:\/\/cces.gov.harvard.edu\/publications\/re-examining-validity-different-survey-modes-measuring-public-opinion-us-findings-\">2011<\/a>). The author of that paper coauthored a paper pointing to severe flaws in the way Richman, Chattha, and Earnest used the data. The original studies and the data provided were not designed to be interpreted to look at this question (this is one example of &#8220;<a href=\"https:\/\/theness.com\/neurologicablog\/index.php\/p-hacking-and-other-statistical-sins\/\">P-hackking<\/a>&#8221; ) . As Ansolabehere <a href=\"https:\/\/cces.gov.harvard.edu\/news\/perils-cherry-picking-low-frequency-events-large-sample-surveys\">states in a rebuttal<\/a>:<\/p>\n\n\n\n<blockquote class=\"wp-block-quote\"><p>&#8220;&#8221;&#8221;Suppose a survey question is asked of 20,000 respondents, and that, of these persons, 19,500 have a given characteristic (e.g., are citizens) and 500 do not. Suppose that 99.9 percent of the time the survey question identifies correctly whether people have a given characteristic, and 0.1 percent of the time respondents who have a given characteristic incorrectly state that they do not have that characteristic. (That is, they check the wrong box by mistake.) That means, 99.9 percent of the time the question correctly classifies an individual as having a characteristic\u2014such as being a citizen of the United States\u2014and 0.1 percent of the time it classifies someone as not having a characteristic, when in fact they do. This rate of misclassification or measurement error is extremely low and would be tolerated by any survey researcher. It implies, however, that one expects 19 people out of 20,000 to be incorrectly classified as not having a given characteristic, when in fact they do.<\/p><p>Normally, this is not a problem. In the typical survey of 1,000 to 2,000 persons, such a low level of measurement error would have no detectable effect on the sample. Even in very large sample surveys, survey practitioners expect a very low level of measurement error would have effects that wash out between two categories. The non-citizen voting example highlights a potential pitfall with very large databases in the study of low frequency categories. Continuing with the example of citizenship and voting, the problem is that the citizen group is very large compared to the non-citizen group in the survey. So even if the classification is extremely reliable, a small classification error rate will cause the bigger category to influence analysis of the low frequency category is substantial ways. Misclassification of 0.1 percent of 19,500 respondents leads us to expect that 19 respondents who are citizens will be classified as non-citizens and 1 non-citizen will be classified as a citizen. (This is a statistical expectation\u2014the actual numbers will vary slightly.) The one non-citizen classified as a citizen will have trivial effects on any analyses of the overall pool of people categorized as citizens, as that individual will be 1 of 19,481 respondents. However, the 19 citizens incorrectly classified as non-citizens can have significant effects on analyses, as they are 3.7 percent (19 of 519) of respondents who said they are non-citizens.<\/p><p>Such misclassifications can explain completely the observed low rate of a behavior, such as voting, among a relatively rare or low-frequency group, such as non-citizens. Suppose that 70 percent of those with a given characteristic (e.g., citizens) engage in a behavior (e.g., voting). Suppose, further, that none of the people without the characteristic (e.g., non-citizens) are allowed to engage in the behavior in question (e.g., vote in federal elections). Based on these suppositions, of the 19 misclassified people, we expect 13 (70%) to be incorrectly determined to be non-citizen voters while 0 correctly classified non-citizens would be voters. Hence, a 0.1 percent rate of misclassification\u2014a very low level of measurement error\u2014would lead researchers to expect to observe that 13 of 519 (2.8 percent) people classified as non-citizens voted in the election, when those results are due entirely to measurement error, and no non-citizens actually voted.<\/p><p>This example parallels the reliability and vote rates in the CCES 2010-2012 panel survey. From this we conclude that measurement error almost certainly explains the observed voting rate among self-identified non-citizens in the CCES\u2014as reported by Richman and his colleagues. &#8220;&#8221;&#8221;<\/p><\/blockquote>\n\n\n\n<p>When I was Conservative, I used to support the idea of voter ID to ensure illegals were not voting and stealing elections. I changed my mind because no one could ever produce evidence that that kind of voter fraud was happening at any rate that justified the <a href=\"https:\/\/fivethirtyeight.com\/features\/what-we-know-about-voter-id-laws\/\">possible disenfranchisement<\/a> of legal voters.  A <a href=\"https:\/\/www.nber.org\/papers\/w25522\">recent study<\/a> suggests that voter ID laws don&#8217;t seem to cause much disenfranchisement. And they also don&#8217;t do much to stop voter fraud either. Of course, Conservative press only reported the results they liked, that voter ID laws <a href=\"https:\/\/www.washingtontimes.com\/news\/2019\/feb\/13\/voter-id-laws-dont-depress-turnout-despite-democra\/\">doesn&#8217;t seem to lead to detectable disenfranchisement<\/a>. But they don&#8217;t mention that there doesn&#8217;t seem to be any detectable fraud either. Unfortunately this paper is behind a paywall, but I&#8217;ll provide a link in case anyone wants to fork out the dough to buy it. This is what the abstract reports:<\/p>\n\n\n\n<blockquote class=\"wp-block-quote\"><p>U.S. states increasingly require identification to vote \u2013 an ostensive attempt to deter fraud that prompts complaints of selective disenfranchisement. Using a difference-in-differences design on a 1.3-billion-observations panel, we find the laws have no negative effect on registration or turnout, overall or for any group defined by race, gender, age, or party affiliation. These results hold through a large number of specifications and cannot be attributed to mobilization against the laws, measured by campaign contributions and self-reported political engagement. ID requirements have no effect on fraud either \u2013 actual or perceived. Overall, our results suggest that efforts to reform voter ID laws may not have much impact on elections.<\/p><\/blockquote>\n\n\n\n<p>So there seems to be two lessons here. First: when you post things to support your political position, try to make sure your supporting data is accurate and says what you want. Second: If you want to make an argument to support legislation to correct a problem, make sure there is a real problem to be solved. It still looks like voter ID is a solution waiting for a problem. <\/p>\n","protected":false},"excerpt":{"rendered":"<div style=\"padding-bottom:20px; padding-top:10px;\" class=\"hupso-share-buttons\"><!-- Hupso Share Buttons - https:\/\/www.hupso.com\/share\/ --><a class=\"hupso_toolbar\" href=\"https:\/\/www.hupso.com\/share\/\"><img src=\"https:\/\/static.hupso.com\/share\/buttons\/share-medium.png\" style=\"border:0px; padding-top: 5px; float:left;\" alt=\"Share Button\"\/><\/a><script type=\"text\/javascript\">var hupso_services_t=new Array(\"Twitter\",\"Facebook\",\"Google Plus\",\"Pinterest\",\"Linkedin\",\"StumbleUpon\",\"Digg\",\"Reddit\",\"Bebo\",\"Delicious\");var hupso_background_t=\"#EAF4FF\";var hupso_border_t=\"#66CCFF\";var hupso_toolbar_size_t=\"medium\";var hupso_image_folder_url = \"\";var hupso_url_t=\"\";var hupso_title_t=\"The%20Perils%20Of%20Bad%20Data%20And%20Bad%20Data%20Interpretation\";<\/script><script type=\"text\/javascript\" src=\"https:\/\/static.hupso.com\/share\/js\/share_toolbar.js\"><\/script><!-- Hupso Share Buttons --><\/div><p>A friend on Facebook posted a write-up in the American Thinker about a report issued by the Texas Secretary of State earlier this year showing 58,000 illegals voted in Texas elections between 1996 and 2015. The entire thing was completely discredited in court due to bad methodologies. After the study was scrutinized, the number of [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[56,27,22],"tags":[],"_links":{"self":[{"href":"https:\/\/sonicfrog.net\/index.php?rest_route=\/wp\/v2\/posts\/8893"}],"collection":[{"href":"https:\/\/sonicfrog.net\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sonicfrog.net\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sonicfrog.net\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/sonicfrog.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=8893"}],"version-history":[{"count":4,"href":"https:\/\/sonicfrog.net\/index.php?rest_route=\/wp\/v2\/posts\/8893\/revisions"}],"predecessor-version":[{"id":8897,"href":"https:\/\/sonicfrog.net\/index.php?rest_route=\/wp\/v2\/posts\/8893\/revisions\/8897"}],"wp:attachment":[{"href":"https:\/\/sonicfrog.net\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=8893"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sonicfrog.net\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=8893"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sonicfrog.net\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=8893"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}