Many Word, docx, or doc documents may have similar but not identical content, such as three-digit capital abbreviations, numeric numbers, hours, course codes, and so on. If you open files one by one and manually search for deletion, it is not only time-consuming, but also easy to miss deletion. This article HeSoft Doc Batch Tool As an example, it demonstrates how to use regular expressions to batch fuzzy find keywords in Word and delete matching content in multiple documents at once by replacing them with empty, which is suitable for batch cleaning of regular text in templates, materials, contracts, and reports.
When sorting out a large number of Word documents, we often encounter a situation: the content that needs to be deleted is not a completely fixed word, but a kind of regular text. For example, multiple docx files have three uppercase English abbreviations, numbers, numbers, hours and other content. It is very inefficient to manually open Word one by one and delete it. This article describes how to use office software" HeSoft Doc Batch Tool ", batch fuzzy search through regular expressions, and replace the matching content with empty, so as to realize batch deletion of keywords in Word, docx and doc files.
Applicable scenario: Which Word content is suitable for regular batch deletion
regular expressions are suitable for Word text that is "not identical in content, but regularly formatted. For example:
- batch delete three uppercase English abbreviations, such as BOT, ELA, etc;
- batch delete numbers in documents, such as 60, 2, 1, 3 and other numbers or values;
- batch cleaning of regular marks in course materials, English documents, contract templates and reports;
- handle multiple Word files at once, avoiding repeated opening, finding, deleting, and saving.
In this tutorial example, six docx files need to be processed. The goal is to batch delete keywords that meet the rules in the Word body, rather than manually modifying them one by one.
Effect preview: pre-treatment and post-treatment comparison
before processing, there are multiple Word documents in the folder that need to be cleaned up. Examples include files such as 1.docx to 6.docx.

Open one of the Word documents and you can see that the body contains content that needs to be deleted, such as "BOT" in the title, "ELA" in the body, and the number "60. These contents have certain rules and are suitable for uniform matching with regular expressions.

After processing, the originally matched three uppercase abbreviations and numbers have been deleted. For example, the "BOT" in the title is cleaned up, the relevant abbreviations and numbers in the body are also removed, and the main content of the document is retained.

Steps: Use regular expressions to batch fuzzy delete Word keywords
the following describes the specific operation according to the process in the software interface. HeSoft Doc Batch Tool It is a batch processing software for office documents, suitable for batch processing of Word document content and reducing duplication of labor.
Step 1: Enter the Word tool and select the Find and Replace function
open HeSoft Doc Batch Tool and select Word Tools on the left ". Find and click "Find and replace keywords in Word" in the list of tools ". This function is used to find and replace keywords in Word file content in batch, and can also be deleted in batch by "replace with empty.

The purpose of this step is to enter the function module specially dealing with Word content keyword search and replacement to prepare for the subsequent addition of multiple docx/doc files.
Step 2: Add Word files that need to be batch processed
after entering the function page, in the step of "selecting records to be processed", you can click "add file" or use "import file from folder". If you want to process a large number of Word documents under the same folder, it is more convenient to use folder import.

After the addition is completed, the software will display the file name, path, extension, creation time, modification time and other information in the list. Six docx files have been imported in the screenshot, indicating that these files will participate in subsequent batch processing. After confirming that the document is correct, click "Next".
Step 3: Select "Use formula blur to find text"
on the Set Processing Options page, find Find ". If you only delete fixed words, you can select exact search. In this example, you need to delete the content according to the rules, so select "use formula to find text in fuzzy". The "formula fuzzy lookup" here can be understood as using rules similar to regular expressions to match text.

After selecting this mode, the software can identify the matching content in the Word document in batch according to the input expression, instead of matching only a fixed string.
Step 4: Fill in the regular rules that need to be found.
In the list of keywords to find, fill in a match rule for each row. In the example, two rules are filled in:
- [A-Z]{3}: used to match three consecutive uppercase letters, such as BOT, ELA, etc;
- \d: Used to match consecutive numbers, such as 60, 2, 123, etc.
These two rules can override the abbreviations and numbers that need to be deleted in the sample document. If you need to delete other regular text in your Word document, you can adjust the expression according to the actual situation.
Step 5: Leave the replaced keyword list blank to realize batch deletion
in the Replaced Keyword List area on the right, the message "If it is not filled in, delete" is displayed ". Therefore, this example does not fill in the replacement content, let the software replace the found content with empty. After this process, the text matching [A-Z]{3} and \d will be deleted from the Word document.
After confirming that the rules are correct, click "Next" to enter the subsequent saving location settings. After setting the saving location according to the interface process, enter the "Start Processing" step to perform batch processing.
Frequently Asked Questions and Precautions
- it is recommended to back up the original file before processing: batch deletion is a content modification operation. It is recommended to keep a copy of the original Word file first to avoid affecting the original document after the rules are written incorrectly.
- Regular rules should be as precise as possible: for example, \d will match all consecutive numbers. If you do not want to delete the page number, number and year in the document, you need to use it carefully.
- Note case matching: the Ignore case of letters option is available in the interface. If you only want to remove uppercase abbreviations, usually do not check this option; if you want to match both upper and lower case, then enable it as needed.
- The replacement list is left blank to delete: the key point of this example is to fill in the search list rules. The replaced keyword list does not fill in the content, thus realizing "delete when found".
- For multiple Word files: whether it is a small number of docx files or a large number of doc/docx documents in a folder, you can reduce duplication through batch import.
Summary: Use regular batch to delete Word keywords to reduce repeated modifications
by HeSoft Doc Batch Tool The "Find and Replace Keywords in Word" function of can turn the work that originally needed to open Word documents one by one and find and delete them one by one into one import, one setting and batch processing. For regular content such as three-digit capital abbreviations, number numbers, course codes, etc., use regular expressions for fuzzy search, and then leave the replacement content blank to quickly complete batch deletion.
If you often need to organize a large number of Word, docx or doc documents, it is recommended to give priority to batch processing: first prepare the file backup, then set the matching rules, and finally perform the processing in a unified way. This can significantly reduce duplication of effort and improve the efficiency of office document cleaning.