New task/project [opendata]: parquet generator for new and existing datasets

During hackathon.lu, I’ll be working on designing a tool to facilitate the generation of Parquet files.

The main source for testing will be the CIRCL Passive SSH database, which will be used to generate a new open data set containing all the scanned SSH key materials.

Different libraries will be tested and evaluated to simply the process.