Programmers are identifiable by Coding-Style

Gepostet vor 2 Jahren, 30 Tagen in #Misc #Science #Tech #Coding #Language

Share: Twitter Facebook Mail

Dass Menschen anhand ihres Schreibstils identifizierbar sind, ist nichts neues. Vor zwei Jahren gab's einen Vortrag auf dem 29C3 über Sprachabdrücke in Underground-Märkten im Darkweb (aka Silkroad u.ä.) Jetzt hat eine neue Studie festgestellt, dass dasselbe Prinzip für Coding gilt und sich da längst nicht nur auf wiederverwendete Funktionen bezieht.

Researchers […] have developed a “code stylometry,” which uses natural language processing and machine learning to determine the authors of source code based on coding style. Their findings, which were recently published in the paper “De-anonymizing Programmers via Code Stylometry,” could be applicable to a wide of range of situations where determining the true author of a piece of code is important. For example, it could be used to help identify the author of malicious source code and to help resolve plagiarism and copyright disputes.

The authors based their code stylometry on traditional style features, such as layout (e.g., whitespace) and lexical attributes (e.g., counts of various types of tokens). Their real innovation, though, was in developing what they call “abstract syntax trees” which are similar to parse tree for sentences, and are derived from language-specific syntax and keywords. These trees capture a syntactic feature set which, the authors wrote, “was created to capture properties of coding style that are completely independent from writing style.”

IT World: CSI Computer Science: Your coding style can give you away (via /.)

./code--poetry

„This website displays a collection of twelve code poems, each written in the source code of a different programming language.…

Stupid Hacks from Stupid Hackathon Sweden

Vor ein paar Tagen ging der Stupid Hackathon Schweden zu Ende, hier eine Liste aller dort realisierten Projekte. Spontan mein…

RechtsLinks 17.2.2017: Fuckopoulos at Maher, PewDiePie is a racist SuperNazi from hell, Trump is 4chan

Pic: Lie Lie Land von Bambi First things first: Milo Fuckopoulos ist heute abend bei Bill Maher: „Yiannopoulos will be…

Cool Coding Sticker: I hate Programming I love Programming!

I can relate. Gibt's auf Redbubble für 3 Euro. (via Sara Soueidan)

Where Slang Comes From

Schönes Posting von Matt Daniels über 2016er Slang: Where Slang Comes From – Emerging slang, identified by searches for words’…

MuscularAnimeGirl[NSFW].jpg = IndustrialMusic.mp3

Cool Bug featuring „a muscular girl“.jpg, anime style („slighly NSFW“ [„it's the only sample I had“ – sure]) and Industrial…

RechtsLinks 12.2.2017: Seeßlen über die Sprache der Rechten und eine Tour der pro-authoritaristischen Neo-Reaktion

The Atlantic: Behind the Internet's Anti-Democracy Movement – White House chief strategist Steve Bannon is reportedly a reader of neoreactionary…

Semiotics of Pepe

PBS Idea Channel über die Semiotik von Pepe: „Feels several different ways, man“. Bücher-Wiki über die im Clip angesprochene Polysemie:…

21 Things that Trump says make a „Bing“-Sound

Bing! 21 things that President Trump says make a "bing" sound pic.twitter.com/7e1Smlwoai — VICE News (@vicenews) February 7, 2017

PostScript-Malware manipulates Print-Jobs

Fefe hat gestern ein interessantes Sicherheits-Leck verlinkt mit dem lapidaren Satz: „Benutzt hier jemand Postscript-Drucker?“ This post is about manipulating…

Why Germans Can Say Things No One Else Can

School of Life: „Backpfeifengesicht, a face that's begging to be slapped. Generosity towards others is key, but German is bracing…