How often does ohloh update a repository?

Avatar

Daniel / Nazca

over 2 years ago

*nod* I did notice that ... just thought it was a little odd that it had been sat there so long doing nothing, but c'est la vie Thankies for the bump, s'all done now :)


Avatar

Peter D'Hoye

over 2 years ago

so... how's the new bandwidth doing? Just wondering because rockbox hasn't been updated since august 6 :/


Avatar

rah003

over 2 years ago

I guess there must be still some backlog. swingx wasn't updates since august 6th either.


Avatar

Daniel Stenberg

over 2 years ago

and curl hasn't moved since July 31st...


Avatar

rah003

over 2 years ago

Possibly unrelated to this: I've tried to browse through projects just to gasp how far back does the backlog possibly goes and found at this page two projects (randomly selected) which are both listed as updated on July 11th on a search page but inside it says last update January 2007 (for Squirrelmail) and March 2007 (for Ruby/ASP). So here are two questions related to this: Is the 'update date' on project browse something else then last update date inside of the project (seems that way) and more importantly is there some problem with the update selection mechanism which can potentially lease some projects not updated for as long as half a year?


Avatar

Robin Luckey

over 2 years ago

From the possibly-more-than-you-ever-wanted-to-know dept:

Projects and analyses are separate. A project can have many analyses laying around in our database, although we only show you the "best" one.

The project timestamp records the time of the last change of any kind to a project, whether it be a new description, tag changes, or license info.

An analysis timestamp records the time of our last query to the source control system.

So when you sort the project list by "updated at", you're seeing projects sorted by any change at all, not necessarily the last time we updated the analysis.

Importantly, it's possible that we have queried the source control more recently than the project was updated. This can happen if there haven't been any changes to the code in a while. In this case, we can continue using the same old analysis for the project. We mark the analysis timestamp to show that we've checked the source control, and the project continues on completely unchanged.

(It's also possible that a project contains no code, or the source control URLs are broken, or we simply have a bug that prevents us from downloading the code. These projects also never change.)

So the answer to question one is yes. There are a lot of projects whose timestamps haven't been updated in a year, yet their analyses are still being timestamped every week.

Question two: well, there could always be a bug, but the scheduler doesn't actually pick projects, the scheduler picks repositories. For every repository, we know the last time we queried the source control. This is what dictates priority. And just so we don't all go crazy, if we update any one repository in a project, we update them all. So projects are scheduled for update according to the time we last checked the source control for any of its repositories.

Last Friday we had a backlog of about 70 new projects, and more new ones are coming in all the time. This morning, we're down to 5 new projects in the queue, and soon we should be completely caught up -- and will hopefully stay that way [until the next boost in Ohloh traffic.] I really appreciate everyone's patience. I'm optimistic that our recent changes are going to make a big difference.


Avatar

rah003

over 2 years ago

Robin & Jason & others, Any update on how is the catching up with backlog going? Three days ago it was:

This morning, we're down to 5 new projects in the queue, and soon we should be completely caught up ...

so I wonder why there are still plenty of projects not updated since end of July beginning of August? I generally don't mind waiting, but if you could push up swingx bit up in the queue it would be great. Thx.


Avatar

Robin Luckey

over 2 years ago

Well, famous last words I guess. Some kind souls have added three separate linux distributions to our system recently, which kept us rather busy this week. We have broken through the log jam, though, and we are currently running updates on existing projects.

Recent changes to our system really will speed things up in the long run -- among other things, it allows us to install a few more servers over the next few days.

Meanwhile, I'll schedule swingx. It should run today.

Thanks, Robin


Avatar

Peter D'Hoye

over 2 years ago

so.... any updates?


Avatar

Andy Verprauskus

over 2 years ago

Hi Peter,

Our jobs are running pretty quickly but I wasn't sure how much longer rockbox might take. So I went ahead and scheduled it. The update finished and should be available now.

Thanks!


Avatar

Peter D'Hoye

over 2 years ago

Thanks! There was no need to, just being curious ;)


Avatar

Ciprian Murariu

over 2 years ago

Not sure if this is the right place to ask, but I try :D

Before I was aware of these topics, I looked into my project facts&figures and realized that phpmychat branches might not be added to the statistics. So I rushed in adding my own branches to the enlistments list.

Even if I've read the forum afterwards, I'm still wondering: Have I wrongly done that? I don't want to result in redundancies or overloading the Ohloh update/overall process. Am I suppose to add a branch or is ohloh going to count the branches of a trunk as well, without having me speciffying them? I appologize, but I'm so confused now.

And guys, excellent job! Keep it very up!!!


Avatar

Ciprian Murariu

over 2 years ago

Ok, I'm back. The branches I added seem to have been processed last night. It increased the number of commits, so I guess it has worked. I am just wondering why my profile rank haven't increased, as all those commits were under my username. Isn't the rank related to activity too? Just the received kudo's?

PS: Oh, and one thing that seems to be a bug: none of the titles I have given to the subversion links have been added. The module name is still blank, even if I did add a name for each branch. Isn't this working for subversions? (I see it worked fine for CVSs). Thank you.


Avatar

Robin Luckey

over 2 years ago

Hi Ciprian,

If you add all of your branches in one project, your report will show too many lines of code. Ohloh will count the lines of code in each branch separately and then add them all up. With 4 or 5 different branches added to Ohloh, the lines of code calculations will be 4 or 5 times too high.

However, if you want all of your commits to show, you don't have much choice. It's a design limitation in Ohloh. We're not smart enough (yet) to recognize that branches are mostly the same code repeated. So unfortunately, when you use Ohloh you have to choose between (a) showing all of your commits across all branches (but having line counts that are too high), or (b) showing a precise line count but missing all of the activity that happens on branches.

KudoRank is only updated once per day, so changes to projects are not immediately reflected in your KudoRank.

Also, it's not enough to write a lot of code. Someone has to use that code. When people add a project to their stack, you will receive Kudo credit if you have made commits to that project. So adding more commits to a project doesn't necessarily help you unless many people have stacked that project.

Don't worry about the Subversion module names. Subversion does not create branches in the same way that other source control systems do. Subversion creates branches by simply copying files to new directories. The module name is always blank for Subversion. As long as the directory name shown in the URL is correct, Ohloh will find the correct code.


Avatar

Ciprian Murariu

over 2 years ago

Thank you, much clear now. Unfortunately, I have to appologize for the delay, as I can't set to receive notifications yet... (I reported that in a different topic here: http://www.ohloh.net/forums/11/topics/855 )


Avatar

Guillermo Roditi

over 2 years ago

Maybe you guys could have an info page which lists the status of the queue for updating projects? To avoid the question of "how long till my project is updated?" I don't know if your queue works like that, but it's be cool for us users and avoid you the annoyance of answering that same question over and over.

Side not of branching: I use SVK and when i svk push back to trunk all my commits happen again, so they should all show up in my project. My project (Reaction) hasn't been updated yet, so I don't know if it'll work like that, but i think it will. So if you are using svn and it really means that much to you, you could use svk. If you are making heavy use of branches then you should probably be using svk anyways, it supports everything svn does and will allow you to have this feature.


Avatar

Robin Luckey

over 2 years ago

We have a bit of a status page here.

Right now, most project reports are about 2 weeks old. We seem to be in an endless arms race between the number of download servers we are maintaining vs. the number of new projects coming into the system. It's great that the Ohloh service has become very popular, but it has been an operational challenge.


Avatar

nigel_ht

over 2 years ago

Have you looked at AWS EC2 for surge capability?


Avatar

Robin Luckey

over 2 years ago

Yes, we've talked about EC2 a bit.

We haven't looked at it very deeply yet for a couple of reasons. Although our application is well distributed across a lot of download servers, those servers are very tightly coupled with our central database (a very big computer), and these download servers require a lot of storage, bandwidth, and CPU. This would seem to make us a poor fit for EC2 right now. We don't have a lot of development time to make the architectural changes it would require.

However, in the long run, we are definitely interested in decoupling our download servers from the database, and creating a much more scalable system. At that point, EC2 might be a reasonable option. We just have so many things on our plate already that it will be a while before we get there.

Robin


Avatar

Ivan Vučica

over 2 years ago

Regarding the branching, LOC and commit problem. Why not add a possibility to add "secondary" repositories, which would not be counted in LOC, but would show up in commits?


Avatar

teravus

about 1 year ago

I could be wrong, however it looks like C# projects are not updating.... Could be the analysis that isn't updating.


Avatar

Thomas Despoix

about 1 year ago

Mediboard project has not been updated for the past 6 weeks, Is there a problem with this one? or is it a typical -- yet extralong -- overload of your servers?


Avatar

Robin Luckey

about 1 year ago

Hi mytto - That is unusual. This project name is familiar to me, as if we had some problems in the past, but from looking through our logs I can't see any reason why this project isn't updating. I've rescheduled the updates, and I'll keep an eye on it today. I'll let you know if something odd comes up.

Thanks, Robin


Avatar

Thomas "zeroflag&...

about 1 year ago

http://www.ohloh.net/projects/11089/enlistments Ohloh update completed 22 days ago.

:/

could you please investigate?

regards.


Avatar

Robin Luckey

about 1 year ago

Hi zeroflag,

Sorry, there's been a lot going on around here, and we are again falling behind on updates. I've scheduled an update for this project.

Robin


Padlock