Soumendra kumar sahoo
Soumendra's Blog

Soumendra's Blog

Series

Odia Language

All work related to Odia language are described over here.

Articles in this series

Pinned article

Scrape News website using Scrapy

Dec 4, 20217 min read 133 views

Abstract To feed data-hungry NLP models of recent times, I have scraped 5,50,000+ news articles from their websites which constitute an average of 50 lakhs sentences and 2 crores plus words of Monolingual Odia corpus. The dataset consists of a header...

Scrape News website using Scrapy
Random Odia Name Generator
Odia language detection
Increase traffic flows to your Facebook page organically
Extracting Parallel-text pairs from Wikipedia
Statistics about Odia Wikipedia