π¬ Automated academic paper metadata & variable extraction with Gemini to Google Sheets
β‘ 431 views Β· π¬ Document Extraction & Analysis
Description
πIn the social and behavioral sciences (e.g., psychology, sociology, economics, management), researchers and students often need to normalize academic paper metadata and extract variables before any literature review or meta-analysis.
π§©This workflow automates the busywork. Using an LLM, it processes CSV/XLSX/XLS files (exported from WoS, Scopus, EndNote, Zotero, or your own spreadsheets) into normalized metadata and extracted variables, and writes a neat table to Google Sheets.
π Example Google Sheet:Β click me
π₯ Who is this for?
- π Undergraduate and graduate students or researchers in soft-science fields (psychology, sociology, economics, business)
- β±οΈ People who donβt have time to read full papers and need quick overviews
- π Anyone who wants to automate academic paper metadata normalization and variable extraction to speed up a literature review
βοΈ How it works
- π€ Upload an academic paper file (CSV/XLSX/XLS) in chat.
- π The workflow creates a Google Sheets spreadsheet with two tabs:
CheckpointandFinalResult. - π A structured-output LLM normalizes core metadata (title, abstract, authors, publication date, source) from the uploaded file and writes it to
Checkpoint; π§ a Gmail notification is sent when finished. - π§ͺ A second structured-output LLM uses the metadata above to extract variables (Independent Variable, Dependent Variable) and writes them to
FinalResult; π§ youβll get a second Gmail notification when done.
π οΈ How to set up
π Credentials
- Google Sheets OAuth2 (read/write)
- Gmail OAuth2 (send notifications)
- Google Gemini (or any LLM you prefer)
π Quick start
- Connect Google Sheets, Gmail, and Gemini (or your LLM) credentials.
- Open
File Upload Triggerβ upload your CSV/XLSX/XLS file and type a name in chat (used as the Google Sheets spreadsheet title). - Watch your inbox for status emails and open the Google Sheets spreadsheet to review Checkpoint and FinalResult.
π Customization
- ποΈ Journal lists: Edit the Journal Rank Classifier code node to add/remove titles. The default list is for business/management journalsβswap it for a list from your own field.
- π Notifications: Replace Gmail with Slack, Teams, or any channel you prefer.
- π§ LLM outputs: Need different metadata or extracted data? Edit the LLMβs system prompt and Structured Output Parser.
π Note
- π Make sure your file includes abstracts. If the academic paper data you upload doesnβt contain an abstract, the extracted results will be far less useful.
- π§© CSV yields no items? Encoding mismatches can break the workflow. If this happens, convert the CSV to
.xlsor.xlsxand try again.
π© Help
Contact: owenlzyxg@gmail.com
π Nodes Used
Google Sheets, Execute Sub-workflow, Gmail, Execute Workflow Trigger, AI Agent, Auto-fixing Output Parser
π₯ Import
Download workflow.json and import into n8n:
Workflow menu β Import from File