HTTP::Proxy::BodyFilter::htmlparser man page on Fedora

Man page or keyword search:  
man Server   31170 pages
apropos Keyword Search (all sections)
Output format
Fedora logo
[printable version]

HTTP::Proxy::BodyFilteUsertContributed PHTTP::Proxy::BodyFilter::htmlparser(3)

NAME
       HTTP::Proxy::BodyFilter::htmlparser - Filter using HTML::Parser

SYNOPSIS
	   use HTTP::Proxy::BodyFilter::htmlparser;

	   # $parser is a HTML::Parser object
	   $proxy->push_filter(
	       mime	=> 'text/html',
	       response => HTTP::Proxy::BodyFilter::htmlparser->new( $parser );
	   );

DESCRIPTION
       The HTTP::Proxy::BodyFilter::htmlparser lets you create a filter based
       on the HTML::Parser object of your choice.

       This filter takes a HTML::Parser object as an argument to its
       constructor.  The filter is either read-only or read-write. A read-only
       filter will not allow you to change the data on the fly. If you request
       a read-write filter, you'll have to rewrite the response-body
       completely.

       With a read-write filter, you must recreate the whole body data. This
       is mainly due to the fact that the HTML::Parser has its own buffering
       system, and that there is no easy way to correlate the data that
       triggered the HTML::Parser event and its original position in the chunk
       sent by the origin server. See below for details.

       Note that a simple filter that modify the HTML text (not the tags) can
       be created more easily with HTTP::Proxy::BodyFilter::htmltext.

   Creating a HTML::Parser that rewrites pages
       A read-write filter is declared by passing "rw => 1" to the
       constructor:

	    HTTP::Proxy::BodyFilter::htmlparser->new( $parser, rw => 1 );

       To be able to modify the body of a message, a filter created with
       HTTP::Proxy::BodyFilter::htmlparser must rewrite it completely. The
       HTML::Parser object can update a special attribute named "output".  To
       do so, the HTML::Parser handler will have to request the "self"
       attribute (that is to say, require access to the parser itself) and
       update its "output" key.

       The following attributes are added to the HTML::Parser object by this
       filter:

       output
	   A string that will hold the data sent back by the proxy.

	   This string will be used as a replacement for the body data only if
	   the filter is read-write, that is to say, if it was initialised
	   with "rw => 1".

	   Data should always be appended to "$parser->{output}".

       message
	   A reference to the HTTP::Message that triggered the filter.

       protocol
	   A reference to the HTTP::Protocol object.

METHODS
       This filter defines three methods, called automatically:

       filter()
	   The "filter()" method handles all the interactions with the
	   HTML::Parser object.

       init()
	   Initialise the filter with the HTML::Parser object passed to the
	   constructor.

       will_modify()
	   This method returns a boolean value that indicates to the system if
	   it will modify the data passing through. The value is actually the
	   value of the "rw" parameter passed to the constructor.

SEE ALSO
       HTTP::Proxy, HTTP::Proxy::Bodyfilter,
       HTTP::Proxy::BodyFilter::htmltext.

AUTHOR
       Philippe "BooK" Bruhat, <book@cpan.org>.

COPYRIGHT
       Copyright 2003-2006, Philippe Bruhat.

LICENSE
       This module is free software; you can redistribute it or modify it
       under the same terms as Perl itself.

perl v5.14.1			  2011-0HTTP::Proxy::BodyFilter::htmlparser(3)
[top]

List of man pages available for Fedora

Copyright (c) for man pages and the logo by the respective OS vendor.

For those who want to learn more, the polarhome community provides shell access and support.

[legal] [privacy] [GNU] [policy] [cookies] [netiquette] [sponsors] [FAQ]
Tweet
Polarhome, production since 1999.
Member of Polarhome portal.
Based on Fawad Halim's script.
....................................................................
Vote for polarhome
Free Shell Accounts :: the biggest list on the net