• Login
Community
  • Login

FAQ: Why Does My .docx File Look Like Junk In Notepad++

Scheduled Pinned Locked Moved FAQ
binarydocxxlsdocxlsxfaq
1 Posts 1 Posters 20.5k Views
Loading More Posts
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • P
    PeterJones
    last edited by PeterJones Feb 6, 2024, 5:29 PM Aug 17, 2018, 1:40 PM

    Hello, and welcome to the FAQ Desk. If you find yourself linked to this post, it is likely because you asked something like “Why does my .docx file look like junk [in Notepad++]?”, or “Why doesn’t Find In Files match anything when searching a directory of PDFs?”

    a3b88fc9-981e-4af1-b89f-b1b5713c680a-image.png

    Notepad++ is an editor for plaintext files. The file types .docx (modern MS Word files) and .pdf (Adobe’s Portable Document Format files) are binary (ie, not plaintext) files which can hold text, formatting, images, embedded objects, links, etc – but the “binary” means they are encoded in such a way that the sequence of bytes in the file (without additional decoding) do not necessarily match any plaintext representations (like ASCII, ISO 8859-*, or UTF8 Unicode), and are thus unintelligible to Notepad++. The fact that Notepad++ renders any of the text from the document as readable plain text, or that its find-in-files feature discovers any matches in those file types, is the exception rather than the rule.

    (There are times, especially in the PDF, when there is some plain text… but there is no guarantee that the sequence you are looking for will stay contained in plaintext; it might get separated by some binary characters, or otherwise have binary control characters embedded along with it, inhibiting your search. Even here, finding what you expect might be the exception rather than the rule.)

    Notepad++ was not built to read such binary files; if you want to read or search .docx files, you need to use a program (usually a word processor, such as MS Word, LibreOffice, OpenOffice, or the like) that is specifically designed to read such files; similarly, for reading .pdf files, you need a program like Adobe Acrobat Reader or other PDF-viewers or editors which are specifically designed to read such files. (The reason “acrobat search … finds many” is because acrobat is designed to read and search .pdf files)

    What you are asking is the equivalent of “I just brought my friend, who only reads English, over to index my personal library: Why is she not able to index my Russian, Hindi, and ancient Greek books?” That friend could be reasonably expected to understand British English, American English, Canadian English, and Australian English (in my analogy, various standard encodings of the same underlying text, such as the ASCII, UTF8, …), but it is unreasonable to expect her to also understand shorthand Sanskrit (in my analogy, a compressed binary format with its own proprietary encoding).

    1 Reply Last reply Reply Quote 7
    • P PeterJones referenced this topic on Dec 15, 2021, 4:14 PM
    • P PeterJones referenced this topic on Dec 15, 2021, 4:20 PM
    • P PeterJones referenced this topic on Jan 24, 2022, 10:17 PM
    • P PeterJones referenced this topic on Jan 24, 2022, 10:17 PM
    • P PeterJones referenced this topic on Jan 24, 2022, 10:19 PM
    • P PeterJones referenced this topic on Apr 28, 2022, 2:27 PM
    • P PeterJones referenced this topic on Jun 17, 2022, 12:57 PM
    • T Terry R referenced this topic on Jun 28, 2022, 3:37 AM
    • P PeterJones referenced this topic on Jul 30, 2022, 8:48 PM
    • P PeterJones referenced this topic on Aug 4, 2022, 1:08 PM
    • P PeterJones referenced this topic on Jan 10, 2023, 2:21 PM
    • P PeterJones referenced this topic on Jan 23, 2023, 9:35 PM
    • P PeterJones referenced this topic on May 6, 2023, 2:14 PM
    • P PeterJones referenced this topic on Jul 12, 2023, 12:54 PM
    • T Terry R referenced this topic on Jul 20, 2023, 7:43 PM
    • P PeterJones referenced this topic on Aug 22, 2023, 1:46 PM
    • P PeterJones referenced this topic on Nov 11, 2023, 3:18 PM
    • P PeterJones referenced this topic on Nov 22, 2023, 2:07 PM
    • M mkupper referenced this topic on Jan 14, 2024, 6:14 AM
    • P PeterJones referenced this topic on Dec 19, 2024, 2:30 PM
    1 out of 1
    • First post
      1/1
      Last post
    The Community of users of the Notepad++ text editor.
    Powered by NodeBB | Contributors