Does BERT understand code? - An exploratory study on the detection of architectural tactics in code (ECSA 2020 - Research Papers) - ECSA 2020 Software Architecture Conference

Who

Jan Keim, Angelika Kaplan, Anne Koziolek, Mehdi Mirakhorli

Track

ECSA 2020 Research Papers

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Fri 18 Sep 2020 16:10 - 16:30 at ECSA 2020 Teams Channel - S13: Smells and Technical Debt (II) Chair(s): Xabier Larrucea, Gabriel A. Moreno

Abstract

Quality-driven design decisions are often addressed by using architectural tactics that are re-usable solution options for certain quality concerns. However, it is not sufficient to only make good design decisions but also to review the realization of design decisions in code. As manual creation of traceability links for design decisions into code is costly, some approaches perform structural analyses to recover traceability links. However, architectural tactics are high-level solutions described in terms of roles and interactions and there is a wide range of possibilities to implement each. Therefore, structural analyses only yield limited results. Transfer-learning approaches using language models like BERT are a recent trend in the field of natural language processing. These approaches yield state-of-the-art results for tasks like text classification. In this paper, we experiment with BERT and present an approach to detect architectural tactics in code by fine-tuning BERT. A 10-fold cross-validation shows promising results with an average F1-Score of 90%, which is on a par with state-of-the-art approaches. We additionally apply our approach on a case study, where the results of our approach show promising potential but fall behind the state-of-the-art. Therefore, we discuss our approach and look at potential reasons as well as issues and downsides. Moreover, we present ideas for future work to improve such a transfer-learning approach.

Jan Keim

Karlsruhe Institute of Technology (KIT)

Germany

Angelika Kaplan

Karlsruhe Institute of Technology

Germany

Anne Koziolek

Karlsruhe Institute of Technology

Germany

Mehdi Mirakhorli

Rochester Institute of Technology

United States

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Fri 18 Sep
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

16:10 - 16:50	S13: Smells and Technical Debt (II)Research Papers / Doctoral Symposium at ECSA 2020 Teams Channel Chair(s): Xabier Larrucea Tecnalia, Gabriel A. Moreno Carnegie Mellon University Virtualization support: Claudio Di Sipio

16:10 20m		Does BERT understand code? - An exploratory study on the detection of architectural tactics in codeshort-paperResearch Track Research Papers Jan Keim Karlsruhe Institute of Technology (KIT), Angelika Kaplan Karlsruhe Institute of Technology, Anne Koziolek Karlsruhe Institute of Technology, Mehdi Mirakhorli Rochester Institute of Technology
16:30 20m		A Semiautomatic Approach to Identify Architectural Technical Debt from Heterogeneous ArtifactsDoctoral Symposium Doctoral Symposium Boris Rainiero Perez Gutierrez University of Los Andes, Colombia