Yahoo is introducint a new module called the XPATH fetch page and is obsoleting an older module called the Fetch Page.
The XPATH Fetch Page module will extract a Web page under 1.5MB of your choice and has several options including the fixing of malformed tags using Tidy. This article on the Yahoo! Pipes Blog goes into further depth explaining the details and provides several examples to try out.
See on pipes.yqlblog.net