Fix Source Ingestion: Deduplicate and Relevance-Filter YouTube Inputs
The input set contains the same YouTube video twice and content unrelated to backend/AI-in-SDLC, exposing gaps in our ingestion pipeline. Add deterministic deduplication by YouTube videoId and a lightweight relevance classifier on titles/descriptions to filter off-topic items. This reduces noise bef...