1001 Freelance Projects -- Build a Python website scraper

Latest Projects from
Freelance Marketplaces

View this project in detail (Note: you will be redirected to external marketplace)
Project title:
Build a Python website scraper
Posted by:
External project from Upwork
Started:
05-Apr-2023 18:39 GMT
Description:
Tool would have a basic UI for a user to input the following: -The pages to scrape. These could be from one of multiple sources: ---This could be a manually entered list (such as copy and pasted into a form). -----A list of URLs to be retrieved from a source URL. For example, user would enter URL X, and the tool would download all links from page X and use that list of links as the pages to scrape for content. -------To prevent scraping every URL, users would be able to provide regex rules for including/excluding URLs. For example, a rule may include only URLs that contain the /news/ directory. ---The content to scrape / collect by including CSS selectors or XPATH rule. -Scraper would collect and store the data in a Google Sheet (preferably) or database (if necessary). -Basic cleanup additions available to clean the data. One, extract URLs from a block of text. Two, remove all HTML and special characters from text. -A script would then run to create an RSS feed based off of the data. The feed would feature the most recent 10 items and be available at a public URL. -Recommend the best affordable option for a cloud service to host the script such as Amazon or Google and we will sign up for it and add you as a user. Set up service as needed and set the scripts to run automatically. -We would have full access to the code and maintain the tool ourselves at the end of the project, although availability for potential future jobs to update the tool would be beneficial.
Project ID:
3319521
Project category:
Python, Web Crawling, Data Scraping, Database
Project budget:

View this project in detail (Note: you will be redirected to external marketplace)

Project	Started
AGIT SUDIANTO Category: Public Relations, System Admin Budget: $10 - $30 USD	01 Apr 2023 16:04 GMT
Brand Manager for Medical Content Writing Category: Artificial Intelligence, Content Writing, Medical Writing, Technical Writing Budget: $250 - $750 USD	01 Apr 2023 16:04 GMT
Logo for after school program Category: Graphic Design, Illustration, Logo Design, Photoshop, T Shirts Budget: $10 - $30 USD	01 Apr 2023 16:04 GMT
I need a soldering specialist -- 2 Category: Electrical Engineering, Electronics, Engineering, Soldering Budget: £250 - £750 GBP	01 Apr 2023 16:04 GMT
Simple copy typing work Category: Copy Typing, Data Entry, Data Processing, Excel, Word Budget: ₹1500 - ₹12500 INR	01 Apr 2023 16:03 GMT
Japanese to Chinese translation (hentai game translation/long term) Category: Japanese Translator, Simplified Chinese Translator, Translation Budget: $750 - $1500 USD	01 Apr 2023 16:03 GMT
Logo Brand Category: 3D Design, Graphic Design, Illustration, Logo Design, Photoshop Budget: $15 - $25 USD	01 Apr 2023 16:02 GMT
Bakerved Ayurvedic Cookies Category: HTML, Web Design Budget: ₹12500 - ₹37500 INR	01 Apr 2023 16:01 GMT
Sewing pattern designer Category: Fashion Design, Pattern Making Budget: $10 - $30 USD	01 Apr 2023 16:00 GMT
graphic designer needed Category: Brochure Design, Corporate Identity, Covers & Packaging, Graphic Design, Logo Design Budget: $15 - $25 USD	01 Apr 2023 16:00 GMT
Add 2 Images to Shopify Page Category: CSS, HTML, Shopify, Shopify Templates, Web Design Budget: $10 - $30 USD	01 Apr 2023 15:59 GMT
Excellent fiction story needed. Category: Creative Writing, Fiction, Ghostwriting, Romance Writing Budget: $150 - $400 USD	01 Apr 2023 15:56 GMT
Small Azure project for 12$ only. Category: Azure, Microsoft Azure Budget: $2 - $8 CAD	01 Apr 2023 15:54 GMT
BUSCO DESARROLLADOR WEB CON EXPERIENCIA COMPROBABLE EN DISEÑO FRONT-END Y BACK-END Category: SEO, Shopify, Web Hosting, Web Design, WordPress Budget: $2 - $8 AUD	01 Apr 2023 15:54 GMT
Looking a flutter developer to customer my application source code Category: Flutter, IPhone, JavaScript, Mobile App Development Budget: ₹1500 - ₹12500 INR	01 Apr 2023 15:54 GMT

Browse All Projects

New!
Проекты на русском (Projects in Russian)