Where opportunity calls parallely !

 
Muhammed Mahbub Hossain
Muhammed Mahbub Hossain
Creating professional solutions for tough professionals

Ins and Outs of Web Scraping - Web Scraping in Perl, Python & Cold Fusion

(page 4 of 5)

=============================Perl =======================================

 

Mechanize library can be used for automating interaction with websites. Mechanize automatically stores and sends cookies, follows redirects, can follow links, and submit forms.

 Example:
 use WWW::Mechanize;
  use Storable;
    $url = 'http://www.linkedbd.com';
    $m = WWW::Mechanize->new();
    $m->get($url);

 

 

=============================Python =====================================

Method 1

urllib2 can be used for creating your own HTTP requests.urllib2  is standard python library.urllib2 module defines functions and classes which help in opening URLs (mostly HTTP) in a complex world basic and digest authentication, redirections, cookies and more.

 

Example:

import f print 
 

Method 2

Mechanize can be used for creating your own HTTP requests. Mechanize  is basically a python web browser. Any url can be opened, not only http. mechanize.UserAgentBase offers easy dynamic configuration of user-agent features like protocol, cookie, redirection.

Example:


import
f print
=============================Cold Fusion==================================

Method 1


cfhttp can be used for creating your own HTTP requests.

Example:

<cfhttp url="http://www.linkedbd.com/">

<cfoutput>

  #cfhttp.filecontent#

</cfoutput>

 

 

Method 2


CFX_HTTP5 Custom Tag for ColdFusion Application server meant to be used in ColdFusion scripts for asynchronous access to HTTP servers.

Example:

<CFX_HTTP URL=http://www.linkedbd.com METHOD=GET OUT=RESULT>

<CFIF STATUS EQ "ER">

    <!---  ERORR PROCESSING --->

    <h2>

    <font color="#aa0000">Server returned error:

    <CFOUTPUT>Error number: #ERRN#</CFOUTPUT></font>

    </h2>

    <CFOUTPUT>#MSG#</CFOUTPUT>

<CFELSE>

    <!--- SUCCESSFUL REQUEST --->

    <CFOUTPUT>#RESULT#</CFOUTPUT>

</CFIF>

 

Rate this article

Average rating :

 

Your rating :

Comments On : Ins and Outs of Web Scraping
M. K. Basher
One of the best short brief on web scraping, I have ever found. Thanks for sharing it.
Aug 02, 2012
Abu Hena...
You describe web scraping with different angles & perspective in a short brief. its really amusing. it clearly describes your depth of knowledge with programming language. we expect another important article from you soon.
Mar 21, 2011
Ajay  Sharma
Thanks For Sharing Such a useful information Thanks
Nov 28, 2009
Muhammed...
Thanks
Mar 25, 2009
MD.Elme...
This is the greatest article I ever found on web about web scraping. It covers vast amount of web scraping technique that are using now a days with almost every modern language.
Mar 25, 2009

Would you like to comment?

Join Paracalls for a free account, or sign in if you are already a member.
Company Sign Up Form
 
Loading ...