Siamese: A Scalable Code Clone Search Engine

View the Project on GitHub siamesetool/siamese


Siamese (Scalable, incremental, and multi-representation) is a code clone search engine.

Why it’s good?

It works with multiple representations of source code to capture code similarity at different structural levels and mines token frequencies in a code corpus on-the-fly and automatically adjusts a query’s length to improve the search speed and accuracy. The tool is scalable to a corpus of hundreds million lines of code and return the results within seconds. It also allows incremental updates to its index to support changes in the software project being analysed.