OpenAlex · Aktualisierung stündlich · Letzte Aktualisierung: 23.03.2026, 16:48

Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Replication Package: Vulnerably (Mis)Configured? Exploring 10 Years of Developers' Q&As on Stack Overflow

2023·0 Zitationen·Zenodo (CERN European Organization for Nuclear Research)Open Access
Volltext beim Verlag öffnen

0

Zitationen

1

Autoren

2023

Jahr

Abstract

Welcome to the public repository for the additional content of the paper "Vulnerably (Mis)Configured? Exploring 10 Years of Developers' Q&As on Stack Overflow", accepted at the International Working Conference on Variability Modelling of Software-Intensive Systems (VAMOS) 2024. This repository provides additional information to the conducted exploratory study on configuration-related vulnerabilities, including the following files: README.txt LICENSE.txt DATASET_CONFIG_VULN_SO.csv: sheet containing data of 651 StackOverflow posts, including additional classifications based on manual analyses and automatic topic modeling Instructions for using the dataset Download and open the dataset (platform-independent CSV file). The dataset includes 16 columns (A – P):- Columns A – J: Original data fetched from the BigQuery Stack Overflow dataset (Question_ID, Year_Asked, Question_Title, Question_Body, Question_Tags, View_Count, Question_Rating, Favorite_Count, Status, Answer_Count)- Columns K – N: Manually extracted data from the Stack Overflow posts (System, Configuration Context, Security Context, Topic)- Column O: Data based on the automated topic modeling (Configuration Topic)- Column P: Additional data extracted from the Stack Overflow posts without further classifications (Additional Comments) Requirements No requirements Further information The dataset is based on a search string (SQL query; August 1, 2023) applied on the Google BigQuery Stack Overflow dataset: ("secur*") AND ("vulnerabilit*" OR "weakness*" OR "breach*" OR "exposure*" OR "CVE*" OR "CWE*") AND ("config*") Originally, the dataset included 1,235 post which were limited by the first and second authors to 651 posts (34 deleted posts, 550 posts out of scope) using the following selection criteria: - The post has been created in the last decade (2013-2022).- The post is still available on the Stack Overflow website.- The post is directly connected to a vulnerability-related issue in the context of configuring. Topic modeling algorithm used: Latent Dirichlet Allocation (LDA)- Settings: 200 iterations (coherence value = 0.6 for k = 7 to 11), α = k, β = 0.01

Ähnliche Arbeiten

Autoren

Themen

Expert finding and Q&A systemsArtificial Intelligence in Healthcare and EducationScientific Computing and Data Management
Volltext beim Verlag öffnen