Hello,
Is it possible to automatically fetch (one time) an HTML page, scrape the page and input the data into Calc?
If we can, please direct me to the correct Help documentation section.
[Solved] Error reading data from the internet
-
- Posts: 37
- Joined: Fri Oct 21, 2016 11:30 pm
[Solved] Error reading data from the internet
Last edited by go2visions on Fri Nov 04, 2016 7:36 pm, edited 3 times in total.
Open Office Version 4.1.2
Windows 10
Windows 10
Re: Scrape an HTML page
menu:Insert>Link to external data...
Paste the URL and wait a few seconds until the table sections are analysed.
Select the section you need.
However, this is a very bad approach. The table data of your html come from some other data source and they are dumped into the html in order to be read by humans. You should access the other data source directly.
If you have no access to the original source because you harvest data from some other companies, there are far more efficient tools than an office suite.
Paste the URL and wait a few seconds until the table sections are analysed.
Select the section you need.
However, this is a very bad approach. The table data of your html come from some other data source and they are dumped into the html in order to be read by humans. You should access the other data source directly.
If you have no access to the original source because you harvest data from some other companies, there are far more efficient tools than an office suite.
Please, edit this topic's initial post and add "[Solved]" to the subject line if your problem has been solved.
Ubuntu 18.04 with LibreOffice 6.0, latest OpenOffice and LibreOffice
Ubuntu 18.04 with LibreOffice 6.0, latest OpenOffice and LibreOffice
-
- Posts: 37
- Joined: Fri Oct 21, 2016 11:30 pm
Re: Scrape an HTML page
Thank you for your reply.
This is the process I a currently use:
1. I go to the external website
2. I press [CTRL]+[A} to select the entire page
3. Next, [CTRL]+[C]
4. Go to an OpenOffice sheet and [CTRL]+[V}
5. This copies the external HTML sheet into OpenOffice
6. It then converts the HTML data to the data I can use
This is very time consuming as I have sometimes 20-30+ pages to convert a day.
As a test, the website I am trying to fetch through Calc is https://mylularoe.com/reports/sales-by-date
This link requires user id and password validation.
When I am logged in, it opens the pasted link in Calc to the Firefox HTML page without error.
The problem is when I go to menu:Insert>Link to external data, I receive the error message
Error reading data from the Internet.
Server error message:.
I have Java 1.8.0_111 installed and running
I would like to insert a column of hyperlinks and have Calc read each page externally, automatically.
Will this be possible? Or what do you suggest to be my best alternative approach?
Thank you in advance for your reply.
This is the process I a currently use:
1. I go to the external website
2. I press [CTRL]+[A} to select the entire page
3. Next, [CTRL]+[C]
4. Go to an OpenOffice sheet and [CTRL]+[V}
5. This copies the external HTML sheet into OpenOffice
6. It then converts the HTML data to the data I can use
This is very time consuming as I have sometimes 20-30+ pages to convert a day.
As a test, the website I am trying to fetch through Calc is https://mylularoe.com/reports/sales-by-date
This link requires user id and password validation.
When I am logged in, it opens the pasted link in Calc to the Firefox HTML page without error.
The problem is when I go to menu:Insert>Link to external data, I receive the error message
Error reading data from the Internet.
Server error message:.
I have Java 1.8.0_111 installed and running
I would like to insert a column of hyperlinks and have Calc read each page externally, automatically.
Will this be possible? Or what do you suggest to be my best alternative approach?
Thank you in advance for your reply.
Open Office Version 4.1.2
Windows 10
Windows 10
Re: Error reading data from the internet
Have a look at http://scraping.pro/software-for-web-scraping/ .
LibreOffice 7.3.7. 2; Ubuntu 22.04
-
- Posts: 1
- Joined: Wed Feb 19, 2020 5:13 pm
Re: [Solved] Error reading data from the internet
There seems to be an issue with authentication. I get the same failure when visiting https:// and success when visiting http://
I have no solution.
I have no solution.
OpenOffice 4.1.6 on PortableApps 16.1 on Win10