Skip to content
The Times USA
Menu
  • ABOUT
  • CONTACT
  • LIFESTYLE
  • NATIONAL NEWS
  • BUSINESS
  • INTERNATIONAL NEWS
  • TECHNOLOGY
  • PRICE OF BUSINESS SHOW AUDIOS
Menu

Digging into Data – Unburied Treasure

Posted on June 18, 2020June 19, 2020 by admin

By Elizabeth Thede, Special for The Times USA

Previously, I addressed how a search index like one generated from the program dtSearch® is like a data treasure map, supporting both individual and enterprise-wide concurrent searching with instant hit-highlighted search results. That way, multiple individuals can use the same treasure map at the same time and each arrive at whatever unique X marks the spot that individual is looking for.

But the flip side to this data treasure map is that indexed search can also reveal information that a person who buried it may never have expected would ever come to light. Following are some examples of such buried information that a search engine may uncover. Understanding these examples is an important step to securing your own data.

(1) An end-user might save a file with a filename that doesn’t match the file type. For example, an end-user might mislabel a Microsoft Excel file with a .PDF extension, or mislabel an email archive with a .DOCX extension. However, in parsing a binary format, a search engine like dtSearch will figure out the correct file format specification to apply by looking inside the binary file itself, rather than simply looking at the filename extension. So saving a file with a mismatched filename extension will not have any effect on the ability to uncover text in that file.

(2) An end-user might also nest a file inside another file. An example of that would be an email with a ZIP or RAR attachment and a PDF and Microsoft Word document inside and an Access database fully embedded in the Microsoft Word file. However, a search engine like dtSearch will work its way recursively through such nested attachments, making the inner file contents just as visible as the cover email.

(3) Many “Office”-type applications let you insert obscure metadata where that metadata won’t appear by default when you look at a file in that application. In fact, you may have to click around extensively to find the metadata such that the likelihood of a casual viewer of the file in its native application finding the metadata is very low. However, to a search engine, that metadata is as readily accessible an any other text.

(4) In many applications, an end-user can also hide text by making it the same color as the background underneath the text. As a result, there can be white on white text or black on black text which inside the application itself may not be readily visible. However, to a search engine that text is easily apparent regardless of whether the text color in the application view matches the background color.

(5) An end-user can also slightly misspell a word. Typos in emails are everywhere – at least in my emails. But with fuzzy searching on, dtSearch will automatically sift through those slight typographical errors to find ProJectX when ProJectX is mistyped as ProPectX.

(6) Still another example is credit card numbers that may be buried in files. There is a feature inside of dtSearch that can check if digits presented together may be a valid credit card, even if there is no MasterCard, Visa or American Express insignia, for example.

(7) The final example relates to “image only” PDFs. Have you ever run across a PDF where you try to cut and paste text from it, but you can’t, because it is an image only? A search engine can’t find the text on such a PDF either, because it is an image only. However, a search engine like dtSearch can flag this type of file when it does its indexing, and let you know that you need to run it through an OCR program like Adobe Acrobat. At that point, the text of this file will be buried no more.

dtSearch enterprise and developer products instantly search terabytes of “Office” files, PDFs, emails along with attachments, databases and web-based data. The products can run “on premises” or on online platforms like Azure and AWS. Because dtSearch can instantly search terabytes of data, many customers are large enterprises like Fortune 100 companies and federal, state and international government agencies.

However, in addition to enterprise-level search, dtSearch also lets you search your own documents, emails and the like. Please go to dtSearch.com, and download a fully-functional 30-day evaluation version to instantly search terabytes of your own data.

RELATED: Kevin Price of the Price of Business show discusses the topic with Thede on a recent interview.

You Might Also Like...

  • Data Privacy's Importance to Americans

    By the Price of Business Show, Hosted by Kevin Price.  The Price of Business is a media…

  • Beyond Boolean Search

    By Elizabeth Thede, Special for The Time USA   Many people have heard of Boolean…

  • Bad Data Practice Hurts the Bottom Line

    A new report from Dun & Bradstreet reveals businesses are missing revenue opportunities and losing customers due…

  • Data Management and Implementation Services for RingLead Customers

    DemandGen International, Inc., a world-class team of digital transformation and technology experts, today announced a…

  • Talent Trends Data Shows Strong Global Outlook for the New Year

    C-suite and human capital leaders surveyed around the world continue to feel positive about their…

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Celebrating 25 Years of the Price of Business Show

https://www.youtube.com/watch?v=5ViFPGoK-ks

VIDEO: This Week’s Best of our Network

https://www.youtube.com/watch?v=y3VtH2emP70

GDPR Compliance

USABR does not collect data on its visitors.  For more information visit: https://www.usabusinessradio.com/contact-us/

Contact

Contact articles@usabusinessradio.net for more information on articles on this site. BMuyco@usabusinessradio.net for all other information.

Recent Articles

  • Are There Any Real Business Deals Around $200K?
  • Escaping the Template Trap: Building a Commercial Website with Real Character
  • Making the Most of the Quiet Months: How Consultants Revitalize Schools Over Summer Break
  • Understanding ETFs: Low-Cost Investing for Modern Portfolios
  • Beyond the Tent: Fun and Memorable Activities for Your Next Camping Trip

Also in TTUSA

  • How To Find Parts For Old Slot Machines – Online slot machines 2020
  • Situational Awareness Expert Beth Warford on Becoming a Hard Target
  • Lower standards? The truth behind chemicals allowed in America, but BANNED elsewhere
  • Economic Growth Projected to Slow Down in Asia and the Pacific Rim
  • Media Authority on the Amazon-MGM Merger and Streaming TV

RSS The Daily Blaze

  • The Significance of Scott Pelley’s Firing
  • Artificial Intelligence and Legal Risk: How Businesses Should Structure Contracts for AI Services
  • Surpassing the Storefront: Industries That Depend on Websites to Showcase Their Services
  • Why the “Knights in Shining Armor” Approach Isn’t Solving Legacy Media Problems
  • Trump Censors History at Our National Parks

RSS USA Business Radio

  • The Hidden Business Problems Behind Accounting Challenges
  • Why Continuous Validation Is Replacing Traditional FedRAMP Compliance
  • Exclusive Coverage of the IBBA Conference in Minneapolis
  • How AlmaHolística Bridges the Gap Between Training and Real-World Practice
  • Your Spell Check Will Go Crazy Over “Trillionaire”

RSS USA Daily Times

  • Playing “Beat the Clock” on Your COVID Relief Refund
  • Essential Cybersecurity Practices Every Small Business Should Embrace in 2026: “Cybersecurity in the Age of AI”
  • The Fatty Acid Burn Switch and the Glucose Cycle
  • How Entertainment Franchises Are Reshaping the Snack Aisle
  • Get Organized Day Is April 26. But if We Aren’t Organized Yet, What Are the Chances This Year Will Be Different?

RSS USA Daily Chronicles.

  • Commercial Real Estate Distress: When Workouts Turn Into Litigation
  • H2 — Talking Health and Hypnosis
  • Reclaiming Every Dollar: The Pandemic-Era Interest Freeze
  • The Value Acceleration Journey: How Privately Held Businesses Intentionally Build Enterprise Value
  • Smart Food Choices To Prevent Diabetes

RSS Price of Business

  • RX Pros: Simplifying Access in a Complex Healthcare System
  • Chris Nicholas Vrame and the Work Behind Big Ideas
  • Dr. Emil Kohan: Leading Through Precision, Innovation, and Vision
  • Do You Want To Get Involved in Real Estate?
  • 3 Ways To Choose the Right Energy Solution for Your Business

RSS US Daily Review

  • Families Face Growing Uncertainty Saving for Education
  • DMV Radio Stars DJ Quicksilva and Asia Chandler to Host 18th Annual Miami Takeover Festival
  • 10 Driving Tips to Save Fuel and Reduce Summer Traffic Stress
  • The GDP Shift: Wealthy Dominance Meets Developing Might
  • One Million Views Later: Sarah Mushka Debunks Hasidic Marriage Myths

PoB Digital Network

US Daily Review

USA Business Radio

USA Daily Chronicles

USA Daily Times

The Daily Blaze

The Times USA

Price of Business

Privacy Policy

https://www.thetimesusa.com/privacy-policy-2/

© 2026 The Times USA | Powered by Superbs Personal Blog theme