Questions tagged [html]
HyperText Markup Language (HTML) is the main markup language for displaying web pages and other information that can be displayed in a web browser.
330 questions
0 votes
1 answer
111 views
Why does Firefox 'copy as cURL' not download any file not download anything here?
I tried to do this: curl "https://imslp.org/wiki/Goldberg-Variationen%2C_BWV_988_(Bach%2C_Johann_Sebastian)" | perl -nle 'print "$1" while /<span id="num-of-ratings-[0-9]{6}...
3 votes
2 answers
203 views
Embedded special characters skewing sed output
The Issue I've been parsing a file with sed trying to tweeze out the desired data. This has worked fine for most lines in the file but there appears to be some embedded special characters that are ...
-2 votes
4 answers
226 views
How to strip data from html using awk?
I'd like to retrieve data from here https://www.sbs.com.au/ondemand/tv-series/la-unidad/season-1. I wget the page to file. The data I seek is in the form of (samples): https://www.sbs.com.au/ondemand/...
1 vote
2 answers
369 views
how can I select element using xmllint command?
I am trying to select "Bvlgari omnia crystalline'perfume' 100ml" by making use of xmllint from the codes below. But As I'm newbie in the field of linux,It is insanely difficult to figure out ...
0 votes
1 answer
93 views
Wget download wrong content
I'm trying to download a specific sitemap.xml (https://www.irna.ir/sitemap/all/sitemap.xml). The problem is that when you load the specific sitemap.xml for a few seconds one white page with a header ...
2 votes
2 answers
441 views
CSS not updating on a `http.server` website
I have a website using the Python http.server module and it was working great. Earlier this day I wanted 2 users to work on the same files (HTML, CSS, JS) so I set the chmod tag to 777. The problem is ...
3 votes
4 answers
780 views
Convert pipe delimited column data to HTML table format for email
I am trying to convert delimited data format to html column table output for email printing and I am unsure how to use pipe delimiter as a separater for HTML tabular formatting. Below is what I could ...
0 votes
2 answers
130 views
BSD sed/awk moving portion of line to line above (switching attribute in HTML file)
My situation is simple : I have an HTML file with several lines containing only the indented <section> block tag, each line followed by an (also indented) <h3 id="YYYY">...</...
0 votes
1 answer
61 views
Use wget to retrieve Supplemental Data from Science dot org
I'm building a pipeline in Snakemake to analyse some data. One of the data files I'm using is provided as supplemental data as part of this publication. The paper is behind a paywall, but I've ...
1 vote
3 answers
176 views
sed: To match a newline and spaces
I have a following file: <head> <title>this is a title</title> <style> here goes a style sheet </style> </head> I need to strip the <title> element ...
0 votes
1 answer
893 views
curl webpage and convert to markdown
having a dilemma with downloading webpages and converting them to markdown, for example: F=$(curl -O --silent https://www.guru3d.com/story/msi-teases-spatium-m560-ssd-with-innovative-nonmetallic-vc-...
1 vote
1 answer
103 views
How can I include any content in the sed replace command? [duplicate]
I want to be able to handle any type of content stored into the bash variable ${CONTENT}, to be used as sed replacement text into another content, no matter if there are quotation marks, single quotes ...
0 votes
1 answer
1k views
Is there a tool that preserves CSS formatting during HTML to PDF conversion?
I tried the options in Is there a script or tool that converts HTML to PDF? with command: pandoc documentation.html -o test.pdf --pdf-engine=xelatex but unfortunately they do not preserve the CSS ...
0 votes
3 answers
122 views
Edit inside an HTML tag with ed(1)
Consider my humble hello.html file, edited with mighty ed: $ ed hello.html 28 ,p <title>Hello world!</title> What's your general approach to edit inside that title HTML tag (bonus if you ...
0 votes
2 answers
1k views
How can I fix a .tar.gz being downloaded as html?
I'm trying to download this file https://www.apache.org/dyn/closer.cgi/hadoop/common/hadoop-3.3.6/hadoop-3.3.6.tar.gz with wget. When I try to unpack it with tar I get the following error: gzip: stdin:...