This site is an archive; learn more about 8 years of OpenHatch.

[OH-Dev] Two different ways to refresh data on old bugs

Asheesh Laroia lists at asheesh.org
Tue Nov 6 02:08:37 UTC 2012


Excerpts from Asheesh Laroia's message of Thu Nov 01 14:37:32 -0400 2012:
> Hello, all OH-Devvers,
> 
> I've been working on oh-bugimporters, and I'm working through the changes 
> necessary to support refreshing data about older bugs: 
> http://openhatch.org/bugs/issue772
> 
> I've hit an architectural issue I wanted some feedback on.
> 
> As background: What we do for Bugzilla, Trac, and Roundup is to re-scrape 
> each bug that we've ever seen, once a day. (We have to do that if, say, 
> the query we're given only shows new bugs, and a bug gets marked as 
> resolved.) For Trac and Roundup, it's O(N) requests to the remote bug 
> tracker. For Bugzilla we can group those into one big query that gets data 
> about all the bugs.
> 
> I've been looking at the Github and Google Code Issues API, and it seems 
> that both of them prefer that you make requests like this by asking for 
> "all bug data that has changed since a date".
> 
> (References:
> 
> Github: http://developer.github.com/v3/issues/
> 
> Google Code: https://code.google.com/p/support/wiki/IssueTrackerAPI )
> 
> Given that, I think it makes sense to make the following changes:
> 
> * In oh-mainline, for the backends that support it, change 
> mysite/customs/models.py GoogleTrackerModel.as_dict() and 
> GithubTrackerModel.as_dict() to also export a special bit of data called 
> get_older_bug_data. That will point to the URL of a query that, when 
> downloaded, will give all updates on that remote bug tracker since 
> whatever date you ask. Pre-configure that query URL to get all updates 
> since the minimum last_polled date of all the bugs we know about from the 
> tracker.

Implemented: https://github.com/openhatch/oh-mainline/pull/88

> * In oh-bugimporters: for Google Code and Github bug trackers, use the 
> get_older_bug_data query to download all updates since that date, and then 
> filter down the results so that we only export the data corresponding to 
> bugs we are actually tracking.

I'll get on this shortly, probably 0-2 days from now. It's a bit 
complicated. If someone wants to beat me to it, that'd be fine too!

-- Asheesh.


More information about the Devel mailing list