<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en-gb">
	<link rel="self" type="application/atom+xml" href="https://forum.eggheads.org/app.php/feed/topic/12150" />

	<title>egghelp/eggheads community</title>
	<subtitle>Discussion of eggdrop bots, shell accounts and tcl scripts.</subtitle>
	<link href="https://forum.eggheads.org/index.php" />
	<updated>2006-07-19T08:25:47-04:00</updated>

	<author><name><![CDATA[egghelp/eggheads community]]></name></author>
	<id>https://forum.eggheads.org/app.php/feed/topic/12150</id>

		<entry>
		<author><name><![CDATA[Yourmove]]></name></author>
		<updated>2006-07-19T08:25:47-04:00</updated>

		<published>2006-07-19T08:25:47-04:00</published>
		<id>https://forum.eggheads.org/viewtopic.php?p=64710#p64710</id>
		<link href="https://forum.eggheads.org/viewtopic.php?p=64710#p64710"/>
		<title type="html"><![CDATA[Ok]]></title>

		
		<content type="html" xml:base="https://forum.eggheads.org/viewtopic.php?p=64710#p64710"><![CDATA[
Oh, I didn't even notice that part. Thanks for the information. I'll try and see what I can do. I'll report back if I still get problems.<br><br>Thanks again.<p>Statistics: Posted by <a href="https://forum.eggheads.org/memberlist.php?mode=viewprofile&amp;u=7981">Yourmove</a> — Wed Jul 19, 2006 8:25 am</p><hr />
]]></content>
	</entry>
		<entry>
		<author><name><![CDATA[De Kus]]></name></author>
		<updated>2006-07-19T03:35:28-04:00</updated>

		<published>2006-07-19T03:35:28-04:00</published>
		<id>https://forum.eggheads.org/viewtopic.php?p=64707#p64707</id>
		<link href="https://forum.eggheads.org/viewtopic.php?p=64707#p64707"/>
		<title type="html"><![CDATA[Parsing HTML encoded in US-Ascii]]></title>

		
		<content type="html" xml:base="https://forum.eggheads.org/viewtopic.php?p=64707#p64707"><![CDATA[
I somehow doubt its a charset problem (since the default charset iso-8859-1 and most others include US-ASCII). I rather believe its because the server returns a gzipped page. The server sends gzipped content even if you explicitly forbid it in the HTTP request or even a HTTP version which doesnt support that and is therefore a violation against HTTP <a href="ftp://ftp.rfc-editor.org/in-notes/rfc2965.txt" class="postlink">RFC 2965</a>/<a href="http://www.w3.org/Protocols/rfc2616/rfc2616-sec14.html#sec14.3" class="postlink">RFC 2616</a> in many ways<br>.you will most likely have to turn over the content to gunzip so you can read uncompressed file then.<blockquote class="uncited"><div>GET /perl-bin/animedb.pl HTTP/1.1<br>Host: anidb.info<br>Accept-Encoding: chunked;q=1, *;q=0<br><br>HTTP/1.1 200 OK<br>Date: Wed, 19 Jul 2006 07:30:27 GMT<br>Server: Apache/1.3.36 (Unix) mod_perl/1.29<br>Set-Cookie: adbuin=1153294273-nVfC; path=/; expires=Sat, 16-Jul-2016 07:31:13 GMT<br>Cache-control: no-cache<br>Pragma: no-cache<br>Content-Type: text/html<br>Expires: Wed, 19 Jul 2006 07:31:13 GMT<br>X-Cache: MISS from anidb.info<br>Content-Encoding: gzip<br>Content-Length: 8216</div></blockquote><blockquote class="uncited"><div>GET /perl-bin/animedb.pl HTTP/1.0<br>Host: anidb.info<br><br>HTTP/1.1 200 OK<br>Date: Wed, 19 Jul 2006 07:31:57 GMT<br>Server: Apache/1.3.36 (Unix) mod_perl/1.29<br>Set-Cookie: adbuin=1153294324-QXPa; path=/; expires=Sat, 16-Jul-2016 07:32:04 GMT<br>Cache-control: no-cache<br>Pragma: no-cache<br>Content-Type: text/html<br>Expires: Wed, 19 Jul 2006 07:32:04 GMT<br>X-Cache: MISS from anidb.info<br>Connection: close<br>Content-Encoding: gzip<br>Content-Length: 8216</div></blockquote>As you can see... it even ignores the HTTP/1.0 request and sends HTTP/1.1 even if its not supported. I wonder if you can make Apache doing that without hardcoding the header in the PERL scripts which would be just plainly stupid from side of the scripter... maybe they don't care about people not being able to use gzip (even old IE would choke on that, since it supported only deflate).<br><br>Hint: if you want to show the &amp;...; encoded Japanese charaters you will most likely have to use UTF-8 or SHIFT-JIS output (and of course find a libary that can convert them to a native encoding supported by TCL).<p>Statistics: Posted by <a href="https://forum.eggheads.org/memberlist.php?mode=viewprofile&amp;u=2382">De Kus</a> — Wed Jul 19, 2006 3:35 am</p><hr />
]]></content>
	</entry>
		<entry>
		<author><name><![CDATA[Yourmove]]></name></author>
		<updated>2006-07-18T16:19:13-04:00</updated>

		<published>2006-07-18T16:19:13-04:00</published>
		<id>https://forum.eggheads.org/viewtopic.php?p=64702#p64702</id>
		<link href="https://forum.eggheads.org/viewtopic.php?p=64702#p64702"/>
		<title type="html"><![CDATA[Parsing HTML encoded in US-Ascii]]></title>

		
		<content type="html" xml:base="https://forum.eggheads.org/viewtopic.php?p=64702#p64702"><![CDATA[
I've been trying (for a very long time) to parse a website (<a href="http://www.anidb.info" class="postlink">http://www.anidb.info</a>) using my eggdrop bot, however for some reason all it would return is jibberish. My other scripts that parsed websites worked fine. I wasn't sure what was happening at first but then I realized that my eggdrop didn't have a *.enc file for us-ascii. I tried to create my own however it seems that I couldn't change the encoding files directory. So I came here (after searching the forums for an answer) to ask if anyone has successfully been able to parse a website that was encoded in US-ASCII and what was the process that you used? I read the tutorials on characters and encoding but...that really didn't help me solve the problem. The system is using TCL 8.4 and I'm using the http package.<br><br>Edit: I'm still new to TCL so please be patient...<p>Statistics: Posted by <a href="https://forum.eggheads.org/memberlist.php?mode=viewprofile&amp;u=7981">Yourmove</a> — Tue Jul 18, 2006 4:19 pm</p><hr />
]]></content>
	</entry>
	</feed>
