[lnkForumImage]
TotalShareware - Download Free Software

Confronta i prezzi di migliaia di prodotti.
Asp Forum
 Home | Login | Register | Search 


 

Carter Temm

10/25/2014 1:54:00 PM

Is there anyway to create a file converter? I am talking about convert Microsoft Word files to plane text, ETC. if this is possible, can I please have a code?
7 Answers

Evertjan.

10/25/2014 3:14:00 PM

0

Carter Temm <crtbraille@gmail.com> wrote on 25 okt 2014 in
comp.lang.javascript:

> Is there anyway to create a file converter? I am talking about convert
> Microsoft Word files to plane text, ETC.

If you mean using Javascript?

Yes.

> if this is possible, can I please have a code?

Why would you think,
that if something is possible,
that it would have been done?

And if it was done,
that the writer would divulge his source-code?

Methinks, you will have to write it yourself,
after serious study about the ins and outs of a .docx file.

Could you do that?

Methinks not, given your Q.

--
Evertjan.
The Netherlands.
(Please change the x'es to dots in my emailaddress)

Thomas 'PointedEars' Lahn

10/25/2014 5:35:00 PM

0

Carter Temm wrote:

> Is there anyway to create a file converter?

(_any way_)

Yes, there are several ways. What is your runtime environment?

> I am talking about convert Microsoft Word files to plane text,

(_plain_)

You need to specify â??Microsoft Word filesâ? as well. There is the binary,
fully proprietary format that is backwards-compatible to Microsoft Word for
Windows 6.0 (filename suffix .doc or DOC), and there is the somewhat
standardized format of Word XML (part of Office Open XML, ECMA-376), which
are basically zipped XML documents, that is used by Microsoft Word 2003 and
later (filename suffix .docx or .DOCX).

> ETC. if this is possible, can I please have a code?

Probably, yes.

However, are you sure that you have posted to the correct newsgroup?

--
PointedEars
FAQ: <http://PointedEars.... | SVN: <http://PointedEars.de...
Twitter: @PointedEars2 | ES Matrix: <http://PointedEars.de/es-...
Please do not Cc: me. / Bitte keine Kopien per E-Mail.

Joao Rodrigues

10/26/2014 1:44:00 PM

0

On 10/25/2014 03:34 PM, Thomas 'PointedEars' Lahn wrote:
> Carter Temm wrote:

<snip>

>> I am talking about convert Microsoft Word files to plane text,
>
> (_plain_)
>
> You need to specify â??Microsoft Word filesâ? as well. There is the binary,
> fully proprietary format that is backwards-compatible to Microsoft Word for
> Windows 6.0 (filename suffix .doc or DOC), and there is the somewhat
> standardized format of Word XML (part of Office Open XML, ECMA-376), which
> are basically zipped XML documents, that is used by Microsoft Word 2003 and
> later (filename suffix .docx or .DOCX).

..doc was the file format used in MS Word 97-2003, and .docx appeared as
of Word 2007. But one could open .docx files in earlier versions of Word
using the "Microsoft Compatibility Pack".

<http://en.wikipedia.org/wiki/List_of_Microsoft_Office_filename_exte...


--
Joao Rodrigues

Thomas 'PointedEars' Lahn

10/26/2014 2:20:00 PM

0

Joao Rodrigues wrote:

> .doc was the file format used in MS Word 97-2003, and .docx appeared as
> of Word 2007. [â?¦]

Simply wrong.

--
PointedEars
FAQ: <http://PointedEars.... | SVN: <http://PointedEars.de...
Twitter: @PointedEars2 | ES Matrix: <http://PointedEars.de/es-...
Please do not Cc: me. / Bitte keine Kopien per E-Mail.

JR

10/27/2014 5:23:00 PM

0

On 26/10/2014 12:20, Thomas 'PointedEars' Lahn wrote:
> Joao Rodrigues wrote:
>
>> .doc was the file format used in MS Word 97-2003, and .docx appeared as
>> of Word 2007. [â?¦]
>
> Simply wrong.
>

You don't know what you're talking about.

--
Joao Rodrigues

Thomas 'PointedEars' Lahn

10/27/2014 8:54:00 PM

0

Joao Rodrigues wrote:

> On 26/10/2014 12:20, Thomas 'PointedEars' Lahn wrote:
>> Joao Rodrigues wrote:
>>> .doc was the file format used in MS Word 97-2003, and .docx appeared as
>>> of Word 2007. [â?¦]
>> Simply wrong.
>
> You don't know what you're talking about.

It's the pot calling the kettle black again. Probably in contrast to you,
I have been there. I have opened and created DOCX documents with a default
setup of Microsoft Word 2003 back in 2006. I have also created DOC
documents with a default setup of Word for Windows 6.0 back in the 1990s.
(Actually, the DOC format dates back even further.)

So, *everything* in your statement is wrong:

1. .doc was _not_ introduced with MS Word 97; see also the date in [1].

2. .doc has _not_ ceased to be supported with Word 2003; it is still
supported (read/write) by the more recent versions of Microsoft Word,
for example Word 2010 where the â??File Typeâ? in the â??Save Asâ? dialog is
labeled in German â??Word 97-2003-Dokumentâ?? because the format was
not formally specified before those versions.

3. Office Open XML and its subset Word XML (.docx) were_not_ introduced with
Office 2007, but with Office 2003. It only was not the *default* before
version 2007 (probably because the Specification was not finished yet).
ECMA-376, that is based on it, was approved by the Ecma General Assembly
in December 2006.

_______
[1] <http://www.iana.org/assignments/media-types/application/...
--
PointedEars
FAQ: <http://PointedEars.... | SVN: <http://PointedEars.de...
Twitter: @PointedEars2 | ES Matrix: <http://PointedEars.de/es-...
Please do not Cc: me. / Bitte keine Kopien per E-Mail.

JR

10/28/2014 1:15:00 PM

0

On 27/10/2014 18:54, Thomas 'PointedEars' Lahn wrote:
> Joao Rodrigues wrote:
>
>> On 26/10/2014 12:20, Thomas 'PointedEars' Lahn wrote:
>>> Joao Rodrigues wrote:
>>>> .doc was the file format used in MS Word 97-2003, and .docx appeared as
>>>> of Word 2007. [â?¦]
>>> Simply wrong.
>>
>> You don't know what you're talking about.
>
> It's the pot calling the kettle black again. Probably in contrast to you,
> I have been there.

I've been using the Microsoft Office suite since 1995, and I've been
using MS Office 2003 until 2011, when I moved on to the 2007 version.
That's why I can say there was not a .docx file format in Word 2003,
unless one have installed the "Microsoft Office Compatibility Pack". In
addition, there is a Microsoft's article about it: "Open a Word 2007
document in an earlier version of Word"
<https://support.office.com/en-us/article/Open-a-Word-2007-document-in-an-earlier-version-of-Word-8fe47805-64a9-4cc5-a115-b148625fe043?ui=en-US&rs=en-US&...

> I have opened and created DOCX documents with a default
> setup of Microsoft Word 2003 back in 2006.

Certainly, you have been updating your Windows / Office installation
with the "Windows Update" and / or you have installed the "Microsoft
Office Compatibility Pack".

> I have also created DOC
> documents with a default setup of Word for Windows 6.0 back in the 1990s.
> (Actually, the DOC format dates back even further.)

ACK. I recall using .doc files since 1995.

>
> So, *everything* in your statement is wrong:

Not *everything*.

>
> 1. .doc was _not_ introduced with MS Word 97; see also the date in [1].

I have not written that the .doc file format was introduced with the MS
Word 97.

>
> 2. .doc has _not_ ceased to be supported with Word 2003; it is still
> supported (read/write) by the more recent versions of Microsoft Word,
> for example Word 2010 where the â??File Typeâ? in the â??Save Asâ? dialog is
> labeled in German â??Word 97-2003-Dokumentâ?? because the format was
> not formally specified before those versions.

I have not written that ".doc has ceased to be supported with Word 2003".

>
> 3. Office Open XML and its subset Word XML (.docx) were_not_ introduced with
> Office 2007, but with Office 2003. It only was not the *default* before
> version 2007 (probably because the Specification was not finished yet).
> ECMA-376, that is based on it, was approved by the Ecma General Assembly
> in December 2006.


Incorrect. MS Word 2003 supported XML, but the .docx binary file format
was introduced with the MS Word 2007. You are mistaking the two things.

--
Joao Rodrigues