๐ŸŽฌ Build a weekly AI trend alerter with arXiv and Weaviate

โšก 12,300 views ยท ๐ŸŽฌ Content Creation & Video

๐Ÿ’ก Pro Tip โ€” HTTP Request scraping tends to break when sites update their markup. If youโ€™re scraping a major platform, check if ScraperNode covers it โ€” it has maintained scrapers for LinkedIn, Instagram, TikTok, YouTube, and 20+ other platforms that return structured data.

View All Scrapers

Description

Build a Weekly AI Trend Alerter with arXiv and Weaviate

Ditch the endless scroll for AI trends. Meet Archi, your personal AI research assistant that hits you up once a week with everyone you need to know. ๐Ÿง‘๐Ÿฝโ€๐Ÿ”ฌ

This workflow scrapes AI and machine learning article abstracts from arXiv, enriches them with topic categories using a LLM, and embeds them in a Weaviate vector store. The vector store is then used as a tool for agentic RAG to write a concise, easy-to-read summary of the week in AI research.

The final output is a short, weekly email sent to the address of your choice that summarizes key AI research trends and future research directions, with links directly to the most interesting and impactful arXiv papers of the week.

Who itโ€™s for

This workflow is for anyone who canโ€™t keep up with all the latest AI advances. Coding skills are not required.

How it works

This is a contiguous workflow that can be summarized in two main parts: a data pipeline that fetches and embeds articles in Weaviate, and an agentic workflow that generates a weekly email summary.

Part 1: Automatically fetch newly published articles on a weekly basis

Part 2: Use an AI Agent and Weaviate to generate a weekly summary email

Prerequisites

  1. An existing Weaviate cluster. You can view instructions for setting up a local cluster with Docker here or a Weaviate Cloud cluster here.
  2. API keys to generate embeddings and power chat models. We use a combination of OpenRouter and OpenAI models. Feel free to switch out the models as you like.
  3. An email address with STMP privileges. This is the address the email will come from. In this demo we use a personal Gmail address. You can create a new credential to link a STMP Account using these instructions.
  4. Self-hosted n8n instance. See this video for how to get set up in just three minutes.

How to run the workflow

  1. Go through the prerequisites, creating a Weaviate cluster (can be local or cloud), downloading self-hosted n8n, creating STMP privileges for your email account, and adding your API keys and other credentials.
  2. Select the embedding and chat models youโ€™d like to use.
  3. Enter the email addresses you want to send the email from and to.
  4. Let it rip.

Workflow output

The output for this workflow is a weekly email that summarizes key research trends and future research directions based on AI and ML papers published on arXiv.

Hereโ€™s an example of a summary email:

Hey there,

Hereโ€™s a quick rundown of the key trends in Machine Learning research from the past week.

๐Ÿ’ซ Key Research Trends This Week

This week saw significant advancements in retrieval-augmented systems, foundation models for specialized domains, and techniques balancing efficiency with performance.

๐Ÿ”ฎ Future Research Directions

Based on current trends, we expect to see the following developments in the near future:

This week highlights how researchers are making AI more efficient, explainable, and applicable to specialized domains. Look out for more developments in RAG systems, tabular foundation models, and privacy-preserving AI techniques in the coming weeks.

Until next week,

Archi ๐Ÿง‘๐Ÿฝโ€๐Ÿ”ฌ

Want to make it better?

Feel free to tweak, build on, or completely reconfigure this workflow. If you come up with something cool, let us know and we might just share it with our community! ๐Ÿ’š

๐Ÿ”— Nodes Used

Send Email, HTTP Request, Markdown, Schedule Trigger, AI Agent, Embeddings OpenAI

๐Ÿ“ฅ Import

Download workflow.json and import into n8n: Workflow menu โ†’ Import from File

๐Ÿ“– Importing guide ยท ๐Ÿ”‘ Credential setup