I just ran cloc . on the main dataverse repo as of 4e1bc5bccd and we have 184K lines of Java code!
2426 text files.
2292 unique files.
281 files ignored.
github.com/AlDanial/cloc v 1.98 T=3.60 s (637.5 files/s, 140827.1 lines/s)
--------------------------------------------------------------------------------
Language files blank comment code
--------------------------------------------------------------------------------
Java 1148 41597 37827 183553
Text 45 370 0 85733
XSD 29 1621 663 23665
XHTML 81 507 738 15879
SQL 127 2441 2603 12616
JSON 185 32 0 11402
reStructuredText 111 9063 5762 11281
JavaScript 39 2045 2385 9853
XML 64 505 1669 8138
Properties 32 176 208 5617
Markdown 57 2389 15 3845
Bourne Shell 156 671 660 2859
XSLT 2 224 24 2304
Python 41 610 670 2072
HTML 25 155 63 1710
CSS 16 193 161 1675
Maven 5 88 166 1622
SVG 29 2 12 1006
YAML 18 146 264 882
Perl 6 238 54 677
R 9 170 182 633
Bourne Again Shell 49 46 60 415
DOS Batch 1 29 1 212
make 3 43 6 208
Dockerfile 4 35 120 193
PlantUML 3 23 1 123
Ruby 1 5 8 42
CSV 4 0 0 39
INI 1 12 31 23
Jupyter Notebook 1 0 239 12
--------------------------------------------------------------------------------
SUM: 2292 63436 54592 388289
--------------------------------------------------------------------------------
Last updated: Nov 01 2025 at 14:11 UTC