1001 Freelance Projects
Latest Projects from
Freelance Marketplaces
View Project
View this project in detail
(Note: you will be redirected to external marketplace)
Project title:
serpapi scholar ai websearch equviliant
Posted by:
External project from PeoplePerHour
Started:
25-Mar-2025 18:40 GMT
Description:
Original Project Overview
We maintain a Google Sheets database of academic figures, each with the following columns:

Name
Surname
Email
University
Subject

We need to automatically fetch the latest two publications for each individual using:
Google Scholar (via SerpAPI’s Scholar API)
ORCID (public API)
OpenAlex (public API)

The results must be inserted back into specified columns in the Google Sheet (F–Q), updated daily or on-demand.

Key Challenges
Sorting by Latest in Google Scholar
By default, Scholar returns top-cited or most-relevant articles, not necessarily newest.
Must force a date sort (&sort=pubdate) or a separate fallback approach.
Handling Ambiguous Names
Multiple authors can share the same name.
Potentially need to detect and filter out mismatched results.
Ensuring Valid Data Types
Google Sheets rejects structured or non-string data.
Must parse out just the string or numeric year/date.
Rate Limits and API Keys
Respect daily or per-minute call limits for SerpAPI, ORCID, OpenAlex.
Must handle errors gracefully.

Automation
Should run as a script from a server or cloud environment (we have console access).
Possibly on a schedule (e.g., daily cron job).

Responsibilities
Set up a robust Python script or application that:
Authenticates to Google Sheets (using service account credentials). DONE
Retrieves our researcher data (Name, Surname, Email, University, Subject) from the “Pubs” or “Topics” sheet.
Calls SerpAPI’s Google Scholar endpoint with date-sorted queries, including fallback logic if no results appear.
Calls ORCID’s public API (optional, can parse structured date fields carefully).
Calls OpenAlex’s public API to fetch the newest two works by each author.
Inserts the results (title, year/date) back into the correct columns in the Google Sheet (F–Q).
Handle name disambiguation or minimal validation so we don’t store obviously incorrect publications.

Deliverables
Python script (or small app) that can be triggered manually or on a schedule.
Successful write-back of 2 publications each from Google Scholar, ORCID, and OpenAlex (6 total) into the correct columns.
Error handling: If no results, store “N/A” or a sensible fallback.
Instructions on setup, environment variables, and usage (e.g., README).




What Still Needs To Be Done
1. Fix Google Scholar Fetch
Main Problem: You're getting most cited or incorrect results.

Needs Fixing:

Ensure sort=pubdate is actually respected in SerpAPI query.

Add fallback logic: if pubdate query fails (returns nothing), retry without it.

Clean up irrelevant results using basic validation (e.g., check university or subject in result string).

2. OpenAlex Improvements
Currently works for some authors.
But still needs name disambiguation logic:

Validate author’s affiliation or match on email (if possible).

Fallback if OpenAlex returns wrong author.

3. ORCID Integration (Optional)
You paused this due to structured date errors.

If you want to include it again later:

Parse structured publication-date fields into simple strings.

Validate the correct ORCID author before fetching works.

4. Error Handling
Improve logging to show:

Which step failed (e.g. API call, parsing, writing to sheet).

Show why (e.g., "no publications found", "mismatched data", "bad year format").

Replace all "N/A" logic with something more transparent (e.g., "No match found for [X]").

5. Name Disambiguation Logic
Currently just using Name + University + Subject in the query.

Needs:

Smarter matching logic to avoid irrelevant or repetitive publications.

Maybe strip long names or fuzzy-match affiliations if exact match fails.

6. Cleaner Sheet Writes
Fix Google Sheets writing errors:

Avoid trying to write structured JSON (like ORCID publication-date objects).

Ensure all fields written are flat strings (title + year/date only).

7. Automation (Optional)
Set up to run automatically (cron job, server scheduler) if needed.

Add logging/output to file for record-keeping.
Project ID:
3427941
Project category:
Project budget:
View this project in detail
(Note: you will be redirected to external marketplace)
Last Projects / Browse Projects
  Project Started
Modern Shopify Site Development -- 2
Category: HTML, Mobile App Development, SEO, Shopify, Shopify Templates, Web Design, Web Development
Budget: ₹12500 - ₹37500 INR
10 Sep 2025 04:04 GMT
Photo Edit: Fix Slouching & Add Person
Category: Background Removal, Graphic Design, Image Consultation, Image Processing, Photo Editing, Photo Retouching, Photoshop, Photoshop Design
Budget: ₹600 - ₹1500 INR
10 Sep 2025 04:04 GMT
Instagram Graphics Design Refresh
Category: Adobe Illustrator, Photoshop, Banner Design, Branding, Graphic Design, Illustration, Logo Design, Real Estate, Social Media Management
Budget: ₹100 - ₹400 INR
10 Sep 2025 04:02 GMT
Office Landscape and Fountain Design
Category: 3D Modelling, 3D Rendering, Architecture, Environmental Consulting, Gardening, Graphic Design, Interior Design, Landscape Design
Budget: ₹600 - ₹1500 INR
10 Sep 2025 04:02 GMT
Signage Production for Customer Awareness
Category: Corporate Identity, Graphic Design, Illustration, Logo Design, Photoshop
Budget: $17.5 - $35 USD
10 Sep 2025 04:01 GMT
LLM-Powered SaaS Web App Development
Category: AI Development, GitHub, LLM Integration, Next.js, Python, React.js, SaaS, Software Architecture, Vercel, Web Development
Budget: $30 - $250 USD
10 Sep 2025 03:59 GMT
Papiacumi Design Recreation
Category: Adobe Illustrator, Photoshop, Graphic Design, Image Processing, Logo Design, Photoshop Design, Typography
Budget: $10 - $30 USD
10 Sep 2025 03:59 GMT
Planos Arquitectónicos en AutoCAD
Category: 3D Modelling, 3D Rendering, AutoCAD, Building Architecture, CAD / CAM
Budget: $30 - $250 USD
10 Sep 2025 03:59 GMT
Full Stack AI Tool Development
Category: Django, Full Stack Development, GitHub, Mobile App Development, Natural Language Processing, Python, React.js, UI / User Interface, Web Development
Budget: $10 - $30 USD
10 Sep 2025 03:56 GMT
Secure Fishery Data Exchange
Category: API Development, Data Analytics, JavaScript, MySQL, PHP, Security, Software Architecture, Web Development
Budget: $250 - $750 USD
10 Sep 2025 03:56 GMT
Vintage Logo Design Needed
Category: Adobe Illustrator, Branding, Graphic Design, Illustration, Logo Design, Photoshop, Print Design, Vector Design
Budget: $2 - $8 USD
10 Sep 2025 03:55 GMT
Editable A3 Event Flyer Redesign
Category: Canva, Flyer Design, Graphic Design, Photoshop, Print Design
Budget: $10 - $30 USD
10 Sep 2025 03:55 GMT
Modern Informative Website Development
Category: CSS, Graphic Design, HTML, JavaScript, PHP, Squarespace, Web Design, Web Development
Budget: $250 - $750 USD
10 Sep 2025 03:54 GMT
Partial MongoDB Data Recovery
Category: Data Extraction, Data Management, Elasticsearch, JSON, MongoDB, MySQL, Node.js, NoSQL Couch & Mongo
Budget: ₹1500 - ₹12500 INR
10 Sep 2025 03:54 GMT
Seasoned Video Editor and Reels Specialist
Category: Adobe Premiere Pro, After Effects, CapCut, Final Cut Pro, Sound Design, Video Editing, Video Production, Videography
Budget: ₹1500 - ₹12500 INR
10 Sep 2025 03:53 GMT
Browse All Projects
Projects by Skills ...
android
ajax
asp
aspnet
cms
cpp
csharp
css
delphi
design
drupal
excel
facebook
flash
html
java
javascript
joomla
iphone
mysql
photoshop
php
python
ruby
seo
sql
sysadm
translate
typing
twitter
vbnet
xml
wordpress
writing
New!
Проекты на русском
(Projects in Russian)

Copyright © 2005-2024
1001 Freelance Projects