[elinks-users] unicode conversion
Rick R
rick.richardson at gmail.com
Mon Jan 28 09:49:36 MST 2008
If I tack this on to the top of the html file I'm decoding it works:
<meta http-equiv="content-type" content="text/html; charset=UTF-8" />
However, since everything I'm going to be viewing is likely in utf8, i
figured I could just set the doc codepage via a local elinks.conf. I
can't get this to work.
The ./elinks.conf:
set document.codepage.assume = "utf8"
set document.codepage.force_assumed = 1
I invoke this as such:
./elinks -config-file ./elinks.conf -dump -dump-charset utf8
-dump-color-mode 0 ./test_fr.html > out/test_fr.txt
Am I doing this incorrectly?
On Jan 25, 2008 5:40 PM, Kalle Olavi Niemitalo <kon at iki.fi> wrote:
> "Rick R" <rick.richardson at gmail.com> writes:
>
> > When converting french, the é, è and à are being converted to *.
> > (hex codes E8, E9 and E0)
> >
> > Is there something I can do to fix this?
>
> This might happen if ELinks misguesses the codepage of the
> document. The default value of the document.codepage.assume
> option is "System". You could try changing that or putting
> some meta http-equiv="content-type" elements in your documents.
>
> If that was not the cause, please describe step by step how
> to reproduce the bug, including any environment variables
> and elinks.conf contents.
>
> _______________________________________________
> elinks-users mailing list
> elinks-users at linuxfromscratch.org
> http://linuxfromscratch.org/mailman/listinfo/elinks-users
>
>
--
"Myths and legends die hard in America. We love them for the extra
dimension they provide, the illusion of near-infinite possibility to
erase the narrow confines of most men's reality. Weird heroes and
mould-breaking champions exist as living proof to those who need it
that the tyranny of 'the rat race' is not yet final." -- Hunter S.
Thompson
More information about the elinks-users
mailing list