[fpc-pascal] fpWeb and html and uriescaping/unescapingelements

ik idokan at gmail.com
Wed May 25 13:10:55 CEST 2011


On Wed, May 25, 2011 at 14:04, Ludo Brands <ludo.brands at free.fr> wrote:

>  > So you suggest to place the & translation last.
> That would be a solution for this problem. My previous comments on the
> "minimalistic" approach of the implementation suggest a different approach:
> scan the source string once and replace html entities as you find them (with
> a look up table for example). This would scale much better when implementing
> iso-8859-1 or '&#entity-number;' unescaping .
>

For every html entity you need a dictionary, and that's makes your program
with more fat.
converting number entity can be done, but you need to know what is the code
page you wish to convert from/to.


>
> Ludo
>

Ido


>   -----Message d'origine-----
> *De :* fpc-pascal-bounces at lists.freepascal.org [mailto:
> fpc-pascal-bounces at lists.freepascal.org] *De la part de* ik
> *Envoyé :* mercredi 25 mai 2011 09:22
>
> *À :* FPC-Pascal users discussions
> *Objet :* Re: RE : Re[2]: [fpc-pascal] fpWeb and html and
> uriescaping/unescapingelements
>
> On Wed, May 25, 2011 at 10:09, Ludo Brands <ludo.brands at free.fr> wrote:
>
>>  You should not unescape recursively.
>> Input to EscapeHTML: '<' Output: '&lt;' : Correct
>> UnescapeHTML: input  '&lt;' Output '<'   Wrong.
>> This is because you replace '&' with '&' resulting in '<' which is
>> translated to '< ' in the next line.
>>
>
> So you suggest to place the & translation last.
>
>
>>
>> Ludo
>>
>>
>>
>>  -----Message d'origine-----
>> *De :* fpc-pascal-bounces at lists.freepascal.org [mailto:
>> fpc-pascal-bounces at lists.freepascal.org] *De la part de* ik
>> *Envoyé :* mercredi 25 mai 2011 08:34
>> *À :* FPC-Pascal users discussions
>> *Objet :* Re: Re[2]: [fpc-pascal] fpWeb and html and uri
>> escaping/unescapingelements
>>
>>
>>
>>
>> On Tue, May 24, 2011 at 12:21, José Mejuto <joshyfun at gmail.com> wrote:
>>
>>> Hello FPC-Pascal,
>>>
>>> Tuesday, May 24, 2011, 10:09:03 AM, you wrote:
>>>
>>> i> I've created a patch with the Escape and unEscape functions, and place
>>> it
>>> i> here: http://bugs.freepascal.org/view.php?id=19407
>>>
>>> Un/escapeHTML parsing must be in one go, specially the "&amp" one. Test
>>> against:
>>>
>>> "&lt;"
>>>
>>
>> I'm not sure what you mean here.
>>
>> If you have already html entities you should not escape them. If you do
>> not have html entities you should escape them.
>> The Escaping and unescaping works well, I already tested them before I
>> sent them.
>>
>>
>>
>>>
>>> Right unescape "<", your code "<".
>>>
>>> --
>>> Best regards,
>>>  José
>>>
>>
>> Ido
>>
>>
>>>
>>> _______________________________________________
>>> fpc-pascal maillist  -  fpc-pascal at lists.freepascal.org
>>> http://lists.freepascal.org/mailman/listinfo/fpc-pascal
>>>
>>
>>
>> _______________________________________________
>> fpc-pascal maillist  -  fpc-pascal at lists.freepascal.org
>> http://lists.freepascal.org/mailman/listinfo/fpc-pascal
>>
>
>
> _______________________________________________
> fpc-pascal maillist  -  fpc-pascal at lists.freepascal.org
> http://lists.freepascal.org/mailman/listinfo/fpc-pascal
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freepascal.org/pipermail/fpc-pascal/attachments/20110525/dfcf605f/attachment.html>


More information about the fpc-pascal mailing list