Ramiro Diaz Trepat
12/14/2007 10:58:00 PM
[Note: parts of this message were removed to make it a legal post.]
Hello list,
I have to develop a simple script to parse some parts of a web site and I
thought it could be a good opportunity to start trying Ruby.
I found that there are two network libraries that I could supposedly use
to retrieve the contents of the web site: open-uri and net-http.
*First problem*
This web site is accessed only with https and has a self issued
certificate. This has made it impossible so far for me to access the
contents of the web site.
Simple examples from the Hpricot html parsing library like this one:
require 'hpricot'
require 'open-uri'
doc = Hpricot(open("https://xxxxxx"))
will not work because the open will fail because of problems due to
https.
*Second problem*
I need to know also how to handle redirection and cookies. But to be
fair, I still can do some further reading myself on these issues.
Thank you very much.