1001 Freelance Projects
Latest Projects from
Freelance Marketplaces
View Project
View this project in detail
(Note: you will be redirected to external marketplace)
Project title:
Data scraping RSS & Automation
Posted by:
External project from Upwork
Started:
12-Aug-2024 18:12 GMT
Description:
Hi,


There are 3 projects, which are complementary but independent (but the project 3 will be carried out if projects 1 and 2 are successful)


This involves working on a job offer flow: either to create another flow, or to integrate the flow into an excel spreadsheet.



PROJECT 1 :

- Create a new TARGET RSS feed based on one (or 2) existing SOURCE RSS feeds, but filtered on the exact criteria requested (criteria to be communicated). The SOURCE RSS feed (a series of job postings constantly updated in real time)

- Format the HTML description part of the TARGET RSS feed (using the formatting of a SOURCE RSS feed) to enable the customer to send a formatted e-mail every day (this part of the e-mail is just for IT purposes to explain the need) via Microsoft's powerautomate.com tool (the customer's environment is Microsoft).

- Development to provide a URL like: https://domaine.fr/fa/rss

- Back and forth for correction/validation

- Start-up, hosting on the customer's server corresponding to his technical environment (question to ask yourself, for example, at the start of the project: which server do I need to store the development? and can it run on the customer's environment?)


See specifications in attach files : Specifications_Feed 1_En_vFinale.pdf



PROJECT 2 :

- Scraping of email addresses (+ employer field) contained in the following SOURCE RSS feed : https://www.emploi-territorial.fr/rss?


(this SOURCE RSS feed is a series of job advertisement publications constantly updated in real time).


This scraping is used to insert the e-mail addresses into a Microsoft Excel spreadsheet stored on the customer's Sharepoint.


- Technical constraints: automatically update a Microsoft Excel spreadsheet online in the Microsoft environment from a URL pointing to the file, using the customer's SharePoint API keys to write to the target file.


- Take into account the following processing after scraping in Excel :

- Structure the file in Tab 1 with 4 columns: e-mail (column A), e-mail identifier before @ (column B), e-mail domain with @ in front (column C), employer field information in the feed (column D).

- Cleans up duplicates in column A with the list of emails created each time the excel file is completed.

- Update frequency: every 6 hours (starting at 8am - Paris time - for the first cycle, then every 6 hours). Cumulate information already in the file with new information (cumulative update)

- A tab 2 contains in column A a manually updated dynamic list of e-mail domains (with the @ in front): delete in tab 1 all lines whose column C corresponds to an e-mail domain in column A of tab 2.

- Correction/validation back and forth

- Start-up, hosting on the customer's server corresponding to his technical environment (question to ask yourself, for example, at the start of the project: which server do I need to store the development? so that it can run on the customer's environment?)


See specifications in attach files : Specifications_Feed 2_En_vFinale.pdf


PROJECT 3 :

- Scraping of 4 data items contained in one (or 2) existing RSS SOURCE feeds, but filtered according to the exact criteria requested (exclusion criteria to be communicated by the customer: criteria identical to the "Project 1" requirements described above).

(this RSS SOURCE feed is a series of job postings constantly updated in real time). This scraping is used to insert the data into a Microsoft Excel spreadsheet stored on the customer's Sharepoint.


- Technical constraints: automatically update a Microsoft Excel spreadsheet online in the Microsoft environment from a URL pointing to the file, using the customer's SharePoint API keys to write to the target file.


- Take into account the following processing after scraping in Excel :

- Structure the file with 13 columns: publication date (column A), employer name (column E), position name (column F), e-mail (column G), position URL (column H) - the other empty columns (at the head of the file in line 1, not to be touched).

- Update frequency: automatic every 24 hours (from 8 a.m. Paris time). Cumulate information already in the spreadsheet with new information (cumulative update)

- Correction/validation back and forth

- Start-up, hosting on the customer's server corresponding to his technical environment (question to ask yourself, for example, at the start of the project: which server is needed to store the development? and can it run on the customer's environment?)


See specifications in attach files : Specifications_Feed 3_En_vFinale.pdf


We'll take the time to explain and answer your questions!

Thank you

Hourly Range: $10.00-$40.00


Project ID:
3432682
Project category:
Data Scraping, Data Extraction, API Integration, Microsoft Excel, RSS, Developmental Editing, Automation
Project budget:
View this project in detail
(Note: you will be redirected to external marketplace)
Last Projects / Browse Projects
  Project Started
Digital Marketing Pro for Music Brand
Category: Advertising, Digital Marketing, Facebook Ads, Facebook Marketing, Internet Marketing, PPC Marketing, Social Media Management, Social Media Marketing
Budget: $15 - $25 USD
08 Jul 2025 10:04 GMT
Real-Time Person Detection System with YOLO
Category: C++, Programming, Computer Vision, Deep Learning, Image Processing, Object Detection, OpenCV, Python, Software Architecture, Video Processing, YOLO
Budget: ₹12500 - ₹37500 INR
08 Jul 2025 10:02 GMT
Django Site Deployment on AWS
Category: Amazon Web Services, Cloud Computing, Django, Linux, Python, SSL, Web Development, Web Hosting
Budget: ₹600 - ₹1500 INR
08 Jul 2025 10:02 GMT
Smart Home Brand Visual Identity Design
Category: Adobe Illustrator, Photoshop, Branding, Graphic Design, Icon Design, Logo Design
Budget: $30 - $250 USD
08 Jul 2025 10:02 GMT
3D Product Design Specialist Needed
Category: 3D Animation, 3D Design, 3D Modelling, 3D Printing, 3D Rendering, 3D Visualization, 3ds Max, Blender, Product Design
Budget: $30 - $250 USD
08 Jul 2025 10:00 GMT
Non-WordPress to WordPress Website Conversion
Category: CSS, Elementor, HTML, JavaScript, PHP, Web Hosting, Web Development, WordPress
Budget: ₹600 - ₹1500 INR
08 Jul 2025 10:00 GMT
Shopify Pricing Integration Fix
Category: API Integration, JavaScript, PHP, Shopify, Shopify Development, Shopify Templates, Web Application, Web Development
Budget: ₹600 - ₹1500 INR
08 Jul 2025 09:59 GMT
High school project
Category: English Translation, Mathematics
Budget: ₹12500 - ₹37500 INR
08 Jul 2025 09:59 GMT
Professional Corporate Brochure Design
Category: Branding, Brochure Design, Content Writing, Corporate Identity, Digital Design, Graphic Design, Marketing, Photoshop, Print Design, Visual Design
Budget: ₹1500 - ₹12500 INR
08 Jul 2025 09:58 GMT
Redesigning Ziplofy for a Modern UI/UX
Category: CSS, Figma, JavaScript, PHP, UI / User Interface, UX / User Experience, Web Design
Budget: ₹1500 - ₹12500 INR
08 Jul 2025 09:58 GMT
Web-Based Inventory Management System
Category: API Development, Backend Development, Database Management, Java, MySQL, PHP, PostgreSQL, Software Architecture, Web Development, Web Design
Budget: $30 - $250 USD
08 Jul 2025 09:58 GMT
Minor PDF Editing
Category: Content Writing, Document Checking, Editing, PDF, Photoshop, Proofreading, Word
Budget: ₹2500 - ₹5000 INR
08 Jul 2025 09:58 GMT
Migrate Website Builder to WordPress + SEO Setup
Category: HTML, Internet Marketing, PHP, SEO, WordPress
Budget: ₹2500 - ₹5000 INR
08 Jul 2025 09:57 GMT
React & Next.js ERP Developer Needed
Category: Frontend Development, HTML5, Java, JavaScript, Next.js
Budget: ₹37500 - ₹75000 INR
08 Jul 2025 09:56 GMT
Modern Restaurant Design for a hotel
Category: 3D Modelling, 3D Rendering, Architecture, Building Architecture, Building Design, Building Engineering, Design, Design Thinking, Graphic Design, Interior Design
Budget: $30 - $250 USD
08 Jul 2025 09:54 GMT
Browse All Projects
Projects by Skills ...
android
ajax
asp
aspnet
cms
cpp
csharp
css
delphi
design
drupal
excel
facebook
flash
html
java
javascript
joomla
iphone
mysql
photoshop
php
python
ruby
seo
sql
sysadm
translate
typing
twitter
vbnet
xml
wordpress
writing
New!
Проекты на русском
(Projects in Russian)

Copyright © 2005-2024
1001 Freelance Projects