--- title: Pashto Datasets description: Curated Pashto datasets for ASR, TTS, NLP, MT, and language technology benchmarking. keywords: pashto datasets, pukhto dataset, pushto data, pashto nlp resources --- # Pashto Datasets This page is for people searching for Pashto datasets across speech and text tasks. ## Start Here - Search all resources: [Pashto resource search](search/index.html) - Dataset index: [resources/datasets/README.md](../resources/datasets/README.md) - Catalog overview: [resource_catalog.md](resource_catalog.md) ## Dataset Coverage - Speech datasets for ASR and TTS. - Text corpora for NLP and MT. - Benchmark-ready subsets and metadata references. ## Related Intent Pages - [Pashto ASR resources](pashto_asr.md) - [Pashto TTS resources](pashto_tts.md) ## Contribution To add a dataset, follow [dataset_guidelines.md](dataset_guidelines.md) and submit a PR with evidence, license, and task tags.