mirror of
https://github.com/xroche/httrack.git
synced 2026-06-21 01:28:35 +03:00
Replace the network dependency for crawl tests with a self-contained Python stdlib server (http.server + ssl) that httrack crawls over loopback. The server binds an ephemeral port and prints it on stdout; local-crawl.sh discovers the port, substitutes the BASEURL token into the httrack arguments, runs the crawl, and audits the mirror under the discovered host-root directory. This prototype migrates two cases off ut.httrack.com: - 13_local-cookies.test drives the cookie chain (entrance/second/third) reimplemented as Python handlers from the old ut/cookies/*.php fixtures. A missing or wrong cookie answers 500, so a clean 3-files/0-errors run proves the cookie jar is replayed across links. - 14_local-https.test crawls over HTTPS using a shipped long-dated self-signed cert. httrack does not verify certs, so the cert is accepted as-is and the real TLS path runs offline. The group skips (exit 77) when python3 is missing, mirroring check-network.sh. Fixtures and the cert are listed explicitly in EXTRA_DIST (automake does not expand globs); make distcheck passes. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> Signed-off-by: Xavier Roche <roche@httrack.com>
19 lines
487 B
HTML
19 lines
487 B
HTML
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
|
|
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
|
|
|
|
<html xmlns="http://www.w3.org/1999/xhtml" lang="fr">
|
|
|
|
<head>
|
|
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
|
|
<title>Sample test</title>
|
|
</head>
|
|
|
|
<body>
|
|
|
|
This is a <a href="link.html?v=1">link</a>
|
|
This is a <a href='link.html?v=2'>link</a>
|
|
This is a <a href="./link.html?v=3">link</a>
|
|
This is a <a href=link.html?v=4>link</a>
|
|
|
|
</body>
|