[Solved] Error reading data from the internet

Discuss the spreadsheet application

[Solved] Error reading data from the internet

Postby go2visions » Thu Nov 03, 2016 2:29 pm

Hello,

Is it possible to automatically fetch (one time) an HTML page, scrape the page and input the data into Calc?

If we can, please direct me to the correct Help documentation section.
Last edited by go2visions on Fri Nov 04, 2016 7:36 pm, edited 3 times in total.
Open Office Version 4.1.2
Windows 10
go2visions
 
Posts: 37
Joined: Fri Oct 21, 2016 11:30 pm

Re: Scrape an HTML page

Postby Villeroy » Thu Nov 03, 2016 3:03 pm

menu:Insert>Link to external data...
Paste the URL and wait a few seconds until the table sections are analysed.
Select the section you need.

However, this is a very bad approach. The table data of your html come from some other data source and they are dumped into the html in order to be read by humans. You should access the other data source directly.
If you have no access to the original source because you harvest data from some other companies, there are far more efficient tools than an office suite.
Please, edit this topic's initial post and add "[Solved]" to the subject line if your problem has been solved.
Ubuntu 18.04, no OpenOffice, LibreOffice 6.4
User avatar
Villeroy
Volunteer
 
Posts: 28664
Joined: Mon Oct 08, 2007 1:35 am
Location: Germany

Re: Scrape an HTML page

Postby go2visions » Thu Nov 03, 2016 7:04 pm

Thank you for your reply.

This is the process I a currently use:

1. I go to the external website
2. I press [CTRL]+[A} to select the entire page
3. Next, [CTRL]+[C]
4. Go to an OpenOffice sheet and [CTRL]+[V}
5. This copies the external HTML sheet into OpenOffice
6. It then converts the HTML data to the data I can use

This is very time consuming as I have sometimes 20-30+ pages to convert a day.

As a test, the website I am trying to fetch through Calc is https://mylularoe.com/reports/sales-by-date
This link requires user id and password validation.
When I am logged in, it opens the pasted link in Calc to the Firefox HTML page without error.

The problem is when I go to menu:Insert>Link to external data, I receive the error message
Error reading data from the Internet.
Server error message:.


I have Java 1.8.0_111 installed and running

I would like to insert a column of hyperlinks and have Calc read each page externally, automatically.
Will this be possible? Or what do you suggest to be my best alternative approach?

Thank you in advance for your reply.
Open Office Version 4.1.2
Windows 10
go2visions
 
Posts: 37
Joined: Fri Oct 21, 2016 11:30 pm

Re: Error reading data from the internet

Postby jrkrideau » Fri Nov 04, 2016 4:34 pm

LibreOffice 6.4.3.2; Ubuntu 20.04
jrkrideau
Volunteer
 
Posts: 3788
Joined: Sun Dec 30, 2007 10:00 pm
Location: Kingston Ontario Canada

Re: [Solved] Error reading data from the internet

Postby Chainmailguy » Wed Feb 19, 2020 5:16 pm

There seems to be an issue with authentication. I get the same failure when visiting https:// and success when visiting http://
I have no solution.
OpenOffice 4.1.6 on PortableApps 16.1 on Win10
Chainmailguy
 
Posts: 1
Joined: Wed Feb 19, 2020 5:13 pm


Return to Calc

Who is online

Users browsing this forum: No registered users and 10 guests