Thierry Schellenbach from YouTellMe.nl recently wrote on his blog that their site is probably the largest Django project in terms of codebase. Since their project isn’t open-source, it’s hard to tell if that claim is true or not.
But it got me wondering. What is the largest open-source Django-based project around?
Some times ago I already posted some ’source lines of code’ (SLoCs) metrics about the Peach³ Django project (41209 at the time).
But is SLoCs the best way to define which project is the largest? Or maybe we should use some more ‘Django-based’ metrics, like number of unique apps (22 for Peach³) or number of models (86)?
Or maybe the largest project is the project with the most people using it on a day-to-day basis (approximately 350 for Peach³ at the moment).
And, using any of these metrics, what would be the largest Django-based open-source project around at the moment? I could not find any metrics of any other projects.
I feel Peach³ might actually be one of the bigger projects out there, but I have nothing to base that feeling on.
Tags: Django, Open source, SLoC
Yeay there’s no good listing of size available.
We are rather large for a Dutch project.
Don’t know internationally actually
Currently we are around 87926 lines of code.
With >200 models
Unique apps 31 (not counting usage of external apps)
Small projects can have a lot of models too. Back in 2008 and early 2009 I was accessing the Chado genome database scheme through the Django ORM. Despite the number of models (> 300 at that time) is was easy to work with due to the structured setup of models. The project had only 6 apps and around 4000 lines of python code at the time I stopped working on the project. The project also had about 1000 lines of custom Javascript code.
I can imagine longer running projects may have a substantial amount of javascript, flash or other non-python code. These should be counted as well since they are part of the working project.
What about documentation? In my view, lots of code with virtually no documentation is rated lower than a small, clear and well documented project.
Satchmo has to be up there. According to ohloh – http://www.ohloh.net/p/satchmo
Satchmo has 144,000+ lines of code+ markup or 71,000+ for code only.
Didn’t know ohloh yet. Interesting site to compare projects. Just added peach³ to it (http://www.ohloh.net/p/peach3), and it’s downloading the repository right now.
Let’s see what the statistics are according to them.
We are launching our product at my company right now (http://www.redecolibri.com/) and, while it’s not as big as Lawrence and other sites that use Django, it’s fairly complex.
It’s a system for managing real estate information, coupled with customer sites in a SaaS fashion (each realtor gets their customizable site inside a subdomain). There’s also a business network where realtors can partner with each other and share information. Something a bit different, as the target market is Brazil, and the real estate market is diverse here.
We are making using of a bunch of different Python and Django related tools. Full text search ORM + Tsearch2, GeoDjango + PostGIS, South, Pinax modules, ImageKit, Python image processing libraries, custom middlewares for SaaS, (many) caching frameworks for memcached and others, Amazon AWS API libraries, django_extensions jobs for billing and cleanup, Fabric deployment scripts, etc. All being developed and improved in record time.
It’s really awesome how easy is to integrate stuff with Django, the framework barely stays in your way (as opposed to my past experiences with other frameworks, namely Rails
). For example, integrating GeoDjango with TSearch2, and being able to search properties using capabilities from both (coordinates and full text indexing) in the same QuerySet was a breeze. Definetly a success case for Django.