<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[Find corrupt files]]></title><description><![CDATA[<p dir="auto">hello. I want to find in all my 4.000 text files those that are corrupt, such as this one:</p>
<p dir="auto"><a href="https://pastebin.com/3xHYckZE" rel="nofollow ugc">https://pastebin.com/3xHYckZE</a>  or here  a print screen <a href="https://snipboard.io/1NrDaT.jpg" rel="nofollow ugc">https://snipboard.io/1NrDaT.jpg</a></p>
<p dir="auto">I try to search for some ascii, but without good result.</p>
]]></description><link>https://community.notepad-plus-plus.org/topic/18148/find-corrupt-files</link><generator>RSS for Node</generator><lastBuildDate>Mon, 15 Jun 2026 10:23:50 GMT</lastBuildDate><atom:link href="https://community.notepad-plus-plus.org/topic/18148.rss" rel="self" type="application/rss+xml"/><pubDate>Mon, 26 Aug 2019 15:01:44 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to Find corrupt files on Mon, 26 Aug 2019 16:02:42 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/vasile-caraus" aria-label="Profile: Vasile-Caraus">@<bdi>Vasile-Caraus</bdi></a> said:</p>
<blockquote>
<p dir="auto">I want to find in all my 4.000 text files those that are corrupt</p>
</blockquote>
<p dir="auto">Assuming you knew what text should be in a good file, which you could guarantee was in a corrupt file, you could use the Notepad++ find-in-files feature to look for files that did have that – but that would find the non-corrupt files, not the corrupt ones.</p>
<p dir="auto">Assuming you knew that all your good files were pure ascii, and the corrupt ones all guaranteed to have something outside of the normal ASCII range (ie, in code points 128-255), you could find-all-files for something like <code>[\x80-\xFF]</code>.</p>
<p dir="auto">If your “good” files are in a full 8-byte encoding (a 1-page European or Asian encoding), or if your good files use a unicode encoding like UTF-8, it would be very difficult to identify “corrupt” vs “good”, because multi-byte encodings will use most or all 8-bit bytes as valid <em>somewhere</em> in the sequence.</p>
<p dir="auto">If you need more help than that, you’ll have to provide more information, like defining what you mean by corrupt, and whether there are certain similarities between all good files or between all corrupt files.</p>
]]></description><link>https://community.notepad-plus-plus.org/post/46593</link><guid isPermaLink="true">https://community.notepad-plus-plus.org/post/46593</guid><dc:creator><![CDATA[PeterJones]]></dc:creator><pubDate>Mon, 26 Aug 2019 16:02:42 GMT</pubDate></item></channel></rss>