--- Log opened Fri Apr 13 00:00:53 2012 14:24 < conseo> db will be ok, if we only request the newest parsed bites. in average this means almost all db request go to the same cached current part of the table 14:25 < conseo> keys are: parsed_date, pollname, maillish-username 14:26 < conseo> mcallan: does that sound reasonable? it will still make sense to create a seperate table for the usernames and pollnames, because many of them will be duplicates 14:28 < conseo> fields needed for the feed: pollname, author-mailishusername, sender-mailishusername, summary, sentDate, url 14:28 < conseo> parsedDate for the effective caching 18:59 < mcallan> http://whiletaker.homeip.net/votorola/harvester/javadoc/votorola/s/wap/HarvestWAP.html 19:00 < mcallan> (1) hPoll is mandatory, but feed can show bites without filtering by poll. how can it request that? 19:05 < mcallan> (2) the purpose of "parsed date" is unclear. don't clients only care about time of posting? 19:10 < mcallan> (3) the names are not consistent, which is confusing. between wap and table there are hUser, username, and maillish-username (spelling mistake there) - and all designate the same information. should all be same name. likewise for hPoll, pollname, and the dates 19:15 < mcallan> (4) "db will be ok, if we only request the newest parsed bites". almost always client will request new messages, but why is requesting old ones a problem? 19:16 < mcallan> (5) HarvestWAP is a "web API for the harvested messages", but that is unclear. javadoc maybe needs a link to the harvest package 19:20 < mcallan> (6) "seperate table for the usernames and pollnames, because many of them will be duplicates". i don't understand. what problem is caused by single table for harvest cache? 19:32 < mcallan> (ad 3) actually it's ok to have nice short "hUser" and elsewhere more explicit "username" (likewise "hPoll" and "pollname"), but don't include "mailish" in formal name. all votorola usernames are mailish 19:36 < mcallan> (7) it's maybe better to store email addresses not usernames in table. usernames are not the canonical identifiers. http://zelea.com/project/votorola/_/javadoc/votorola/a/voter/IDPair.html#email() 19:38 < mcallan> column name might be "sender_email" or simply "sender" 19:40 < mcallan> (sorry, sender has different meaning in email. better would be from_email or author/author_email) 19:40 < conseo> mcallan: Oracle says: "While copyrighting a computer language cannot prevent others from designing programming languages that serve the same functions, the detailed vocabulary and written expression of the computer language should be protectable elements if sufficiently original and creative." 19:41 < conseo> http://www.theverge.com/2012/4/13/2944440/google-oracle-lawsuit-programming-language-copyright 19:41 < conseo> just read that, will reply later, have visitors :-) 19:44 < mcallan> ok (interesting, google is right that a language cannot be copyright. it would have to be patented, or the name trademarked, but there is no way to copyright the *means* of expression) --- Log closed Sat Apr 14 00:00:09 2012