[Solved] Error reading data from the internet

Discuss the spreadsheet application
Post Reply
go2visions
Posts: 37
Joined: Fri Oct 21, 2016 11:30 pm

[Solved] Error reading data from the internet

Post by go2visions »

Hello,

Is it possible to automatically fetch (one time) an HTML page, scrape the page and input the data into Calc?

If we can, please direct me to the correct Help documentation section.
Last edited by go2visions on Fri Nov 04, 2016 7:36 pm, edited 3 times in total.
Open Office Version 4.1.2
Windows 10
User avatar
Villeroy
Volunteer
Posts: 31279
Joined: Mon Oct 08, 2007 1:35 am
Location: Germany

Re: Scrape an HTML page

Post by Villeroy »

menu:Insert>Link to external data...
Paste the URL and wait a few seconds until the table sections are analysed.
Select the section you need.

However, this is a very bad approach. The table data of your html come from some other data source and they are dumped into the html in order to be read by humans. You should access the other data source directly.
If you have no access to the original source because you harvest data from some other companies, there are far more efficient tools than an office suite.
Please, edit this topic's initial post and add "[Solved]" to the subject line if your problem has been solved.
Ubuntu 18.04 with LibreOffice 6.0, latest OpenOffice and LibreOffice
go2visions
Posts: 37
Joined: Fri Oct 21, 2016 11:30 pm

Re: Scrape an HTML page

Post by go2visions »

Thank you for your reply.

This is the process I a currently use:

1. I go to the external website
2. I press [CTRL]+[A} to select the entire page
3. Next, [CTRL]+[C]
4. Go to an OpenOffice sheet and [CTRL]+[V}
5. This copies the external HTML sheet into OpenOffice
6. It then converts the HTML data to the data I can use

This is very time consuming as I have sometimes 20-30+ pages to convert a day.

As a test, the website I am trying to fetch through Calc is https://mylularoe.com/reports/sales-by-date
This link requires user id and password validation.
When I am logged in, it opens the pasted link in Calc to the Firefox HTML page without error.

The problem is when I go to menu:Insert>Link to external data, I receive the error message
Error reading data from the Internet.
Server error message:.


I have Java 1.8.0_111 installed and running

I would like to insert a column of hyperlinks and have Calc read each page externally, automatically.
Will this be possible? Or what do you suggest to be my best alternative approach?

Thank you in advance for your reply.
Open Office Version 4.1.2
Windows 10
jrkrideau
Volunteer
Posts: 3816
Joined: Sun Dec 30, 2007 10:00 pm
Location: Kingston Ontario Canada

Re: Error reading data from the internet

Post by jrkrideau »

LibreOffice 7.3.7. 2; Ubuntu 22.04
Chainmailguy
Posts: 1
Joined: Wed Feb 19, 2020 5:13 pm

Re: [Solved] Error reading data from the internet

Post by Chainmailguy »

There seems to be an issue with authentication. I get the same failure when visiting https:// and success when visiting http://
I have no solution.
OpenOffice 4.1.6 on PortableApps 16.1 on Win10
Post Reply