New multilingual severity classifiers for vulnerability analysis

:rocket: We’ve just published a new article introducing a Russian-language severity classifier, along with improved English and Chinese models for vulnerability descriptions.

:backhand_index_pointing_right: New Russian Severity Classifier and Improved Multilingual Models – Vulnerability-Lookup

These models are trained with VulnTrain and served via ML-Gateway, and are fully integrated into Vulnerability-Lookup.

:magnifying_glass_tilted_left: What’s new

  • :russia: Russian severity classifier based on FSTEC (BDU) data, using a ruRoBERTa-large model
  • :united_kingdom: Improved English model trained on diverse sources (CVE, GitHub, PySec, CSAF…)
  • :china: Improved Chinese model leveraging CNVD data
  • :bar_chart: Better training insights with per-class precision / recall / F1 metrics
  • :puzzle_piece: Multi-source datasets with traceable origins and dynamic dataset cards

All datasets and models are openly available on Hugging Face.

:gear: Under the hood

  • VulnTrain 3.1.0 adds FSTEC support, dataset traceability, and improved model selection
  • ML-Gateway 0.5.0 now supports multilingual severity classification out of the box

This work is part of the AIPITCH project, supporting practical AI integration in cybersecurity workflows.

1 Like