[lnkForumImage]
TotalShareware - Download Free Software

Confronta i prezzi di migliaia di prodotti.
Asp Forum
 Home | Login | Register | Search 


 

Forums >

comp.lang.ruby

How to get contents of word file page by page

Talib Hussain

12/12/2008 10:04:00 AM

Hi,

I have a 3 paged document, I want to read contents of each page. How cn
i do that.

TIA,
Talib Hussain
--
Posted via http://www.ruby-....

12 Answers

Talib Hussain

12/12/2008 11:46:00 AM

0

Talib Hussain wrote:
> Hi,
>
> I have a 3 paged document, I want to read contents of each page. How cn
> i do that.
>
> TIA,
> Talib Hussain

Anyone please
--
Posted via http://www.ruby-....

David Mullet

12/12/2008 1:43:00 PM

0

Talib Hussain wrote:
> Hi,
>
> I have a 3 paged document, I want to read contents of each page. How cn
> i do that.
>
> TIA,
> Talib Hussain

Assuming...

-- You are working with a Microsoft Word document.
-- You have actual page breaks between pages

...you can create an array of the text on each page by getting the
document contents' text and splitting it on the page break. So, where
doc is your Word document object, you can do this:

pages = doc.content.text.split("\f")
pages.each do |page|
# do something with this page's text
end

Hope that helps.

David

http://rubyonwindows.bl...
http://rubyonwindows.bl.../search/label/word
--
Posted via http://www.ruby-....

Heesob Park

12/12/2008 1:45:00 PM

0

2008/12/12 Talib Hussain <talibhn@gmail.com>:
> Hi,
>
> I have a 3 paged document, I want to read contents of each page. How cn
> i do that.
>
If you want only text contents, try this

require 'win32ole'
word = WIN32OLE.new('word.application')
file = 'c:/work/test.doc'
doc = word.documents.open(file,'ReadOnly' => true)
page = doc.ComputeStatistics(2) # wdStatisticPages = 2
for i in 1..page
word.selection.goto(1,1,i) # wdGoToPage = 1
word.selection.goto(-1,0,0,'\page') # wdGoToBookmark = -1
puts "PAGE #{i}"
puts word.selection.text
end
word.activedocument.close(false)
word.quit

Regards,
Park Heesob

Talib Hussain

12/15/2008 4:46:00 AM

0

Heesob Park wrote:
> 2008/12/12 Talib Hussain <talibhn@gmail.com>:
>> Hi,
>>
> Regards,
> Park Heesob

Thanks a lot Park, you are genius.

My requirements is that I have a document (Word file) of say 3 pages
with formatted text.

I need to extract the contents of each page with formatting and save
that as a seprate .PDF document.

Is this possible? If yes how can I do that?

Also, do I need to install Office 2007 in order to save files as .PDF
documents.

Kindly let me know.

--
Posted via http://www.ruby-....

Firstname Secondname

12/15/2008 6:20:00 AM

0

You must be trying to solve a problem (word document convertation to
pdf) with a wrong tool:). You don't need ruby to convert word file to
pdf. There are tools like Word2pdf for this.

Talib Hussain wrote:
> Heesob Park wrote:
>> 2008/12/12 Talib Hussain <talibhn@gmail.com>:
>>> Hi,
>>>
>> Regards,
>> Park Heesob
>
> Thanks a lot Park, you are genius.
>
> My requirements is that I have a document (Word file) of say 3 pages
> with formatted text.
>
> I need to extract the contents of each page with formatting and save
> that as a seprate .PDF document.
>
> Is this possible? If yes how can I do that?
>
> Also, do I need to install Office 2007 in order to save files as .PDF
> documents.
>
> Kindly let me know.

--
Posted via http://www.ruby-....

Talib Hussain

12/15/2008 6:43:00 AM

0

Name Surname wrote:
> You must be trying to solve a problem (word document convertation to
> pdf) with a wrong tool:). You don't need ruby to convert word file to
> pdf. There are tools like Word2pdf for this.
>
> Talib Hussain wrote:
>> Heesob Park wrote:
>>> 2008/12/12 Talib Hussain <talibhn@gmail.com>:
>>>> Hi,
>>>>
>>> Regards,
>>> Park Heesob
>>
>> Thanks a lot Park, you are genius.
>>
>> My requirements is that I have a document (Word file) of say 3 pages
>> with formatted text.
>>
>> I need to extract the contents of each page with formatting and save
>> that as a seprate .PDF document.
>>
>> Is this possible? If yes how can I do that?
>>
>> Also, do I need to install Office 2007 in order to save files as .PDF
>> documents.
>>
>> Kindly let me know.


Agreed, but I have to create 3 seprate doc files out of one document
(each page of the document) and send these files as input to the pdf
converter
--
Posted via http://www.ruby-....

Firstname Secondname

12/15/2008 6:56:00 AM

0

If you have Word2pdf like program, then check if you can specify which
page to covert. You could call Word2pdf several times specifying
different page numbers to convert.

Word2pdf -n 1 infile.doc out1.pdf
Word2pdf -n 2 infile.doc out2.pdf
Word2pdf -n 3 infile.doc out3.pdf

:D
The only thing here is to find(have) Word2pdf program which supports
that :).


Talib Hussain wrote:
> Name Surname wrote:
>> You must be trying to solve a problem (word document convertation to
>> pdf) with a wrong tool:). You don't need ruby to convert word file to
>> pdf. There are tools like Word2pdf for this.
>>
>> Talib Hussain wrote:
>>> Heesob Park wrote:
>>>> 2008/12/12 Talib Hussain <talibhn@gmail.com>:
>>>>> Hi,
>>>>>
>>>> Regards,
>>>> Park Heesob
>>>
>>> Thanks a lot Park, you are genius.
>>>
>>> My requirements is that I have a document (Word file) of say 3 pages
>>> with formatted text.
>>>
>>> I need to extract the contents of each page with formatting and save
>>> that as a seprate .PDF document.
>>>
>>> Is this possible? If yes how can I do that?
>>>
>>> Also, do I need to install Office 2007 in order to save files as .PDF
>>> documents.
>>>
>>> Kindly let me know.
>
>
> Agreed, but I have to create 3 seprate doc files out of one document
> (each page of the document) and send these files as input to the pdf
> converter

--
Posted via http://www.ruby-....

Saji N. Hameed

12/15/2008 7:59:00 AM

0

* Name Surname <mzilenas@gmail.com> [2008-12-15 15:55:34 +0900]:

> If you have Word2pdf like program, then check if you can specify which
> page to covert. You could call Word2pdf several times specifying
> different page numbers to convert.
>
> Word2pdf -n 1 infile.doc out1.pdf
> Word2pdf -n 2 infile.doc out2.pdf
> Word2pdf -n 3 infile.doc out3.pdf
>
> :D
> The only thing here is to find(have) Word2pdf program which supports
> that :).
>

Surely, openoffice must have something - you can export word documents
as PDFs - there may be a corresponding command line utility...


saji

>
> Talib Hussain wrote:
> > Name Surname wrote:
> >> You must be trying to solve a problem (word document convertation to
> >> pdf) with a wrong tool:). You don't need ruby to convert word file to
> >> pdf. There are tools like Word2pdf for this.
> >>
> >> Talib Hussain wrote:
> >>> Heesob Park wrote:
> >>>> 2008/12/12 Talib Hussain <talibhn@gmail.com>:
> >>>>> Hi,
> >>>>>
> >>>> Regards,
> >>>> Park Heesob
> >>>
> >>> Thanks a lot Park, you are genius.
> >>>
> >>> My requirements is that I have a document (Word file) of say 3 pages
> >>> with formatted text.
> >>>
> >>> I need to extract the contents of each page with formatting and save
> >>> that as a seprate .PDF document.
> >>>
> >>> Is this possible? If yes how can I do that?
> >>>
> >>> Also, do I need to install Office 2007 in order to save files as .PDF
> >>> documents.
> >>>
> >>> Kindly let me know.
> >
> >
> > Agreed, but I have to create 3 seprate doc files out of one document
> > (each page of the document) and send these files as input to the pdf
> > converter
>
> --
> Posted via http://www.ruby-....
>
>

--
Saji N. Hameed

APEC Climate Center +82 51 668 7470
National Pension Corporation Busan Building 12F
Yeonsan 2-dong, Yeonje-gu, BUSAN 611705 saji@apcc21.net
KOREA



Anandh Kumar

6/12/2009 2:54:00 PM

0



Thanks park... that was good... now say, my word document has got
some student detail information such as name,marks register no... these
are the entries i'll be having... say me how to parse this strings and
upload it to the database...







Thanks
--
Posted via http://www.ruby-....

duke

12/13/2012 1:17:00 PM

0

On Wed, 12 Dec 2012 09:42:05 -0700, Yoorghis@Jurgis.net wrote:

>On Wed, 12 Dec 2012 10:32:47 -0600, duke <duckgumbo32@cox.net> wrote:
>
>>>
>>>Utter balderdash
>>
>>We who don't work for unions call that "truth you never heard of".
>
>You who do not bargain collectively, are why workers are paid less,
>idiot.

But they get jobs rather than sitting at the union hall looking at each other.
Unions have outlived their value to one another and to industry that must now
compete on a global scale.

The dukester, American - American
********************************************
You can't fix stupid.
********************************************