Both data recovery tools are portable programs that can be executed after unpacking them. The interface looks similar in both applications. It offers options to load and save docx and xlsx files. It will analyze the selected document and try to recover its text and values. The recovered document will then be displayed in the interface of the programs. The displayed data can be edited right away or saved as txt or csv files for further editing in programs that are better suited for editing the data.
Damaged docx2txt
This is a GUI version of the great docx2txt Perl script by Sandeep Kumar. It will extract text from damaged/corrupted Word 2007 files where Word 2007 fails.
Word 2007 files are actually zipped collections of XML files and XML as a format is unforgiving of data corruption. The main text in Word 2007 docx files is found in document.xml file in the collection. Damaged docx2txt uses CakeCMD , an unzipper that will unzip partially corrupt document.xml files. Also the Perl routine used to extract the text from the document.xml file doesn't care about well-formedness of the XML, a possible stumbling block of Word 2007.
http://www.godskingsandheroes.info/software/dd2txt-0.52.zip
Corrupt xlsx2csv
Corrupt xlsx2csv is a new freeware GUI program for salvaging the data from corrupt Excel 2007 files. Xlsx Excel 2007 files are really zipped collections of XML files. The main raw data is contained in the sharedStrings and numbered worksheet XML files. XML is a very unforgiving medium when it comes to data corruption, thus if the sharedStrings and or worksheet XML files become corrupt, Excel has difficulty recovering the unformatted data.
Corrupt xlsx2csv uses a command line unzipping program that will unzip partially corrupt worksheet[#].xml and sharedStrings.xml files. Also the Perl data extraction routines don't use XML techniques that care about well formed XML, a stumbling block for other Excel 2007 recovery programs.
http://www.godskingsandheroes.info/softwar...el2csv-0.50.zip
Attached thumbnail(s)
May 8 2009, 09:38 AM, updated 17y ago
Quote
0.0144sec
2.47
7 queries
GZIP Disabled