Tillbaka till svenska Fidonet
English   Information   Debug  
OS2HW   0/42
OS2INET   0/37
OS2LAN   0/134
OS2PROG   0/36
OS2REXX   0/113
OS2USER-L   207
OS2   0/4786
OSDEBATE   0/18996
PASCAL   0/490
PERL   0/457
PHP   0/45
POINTS   0/405
POLITICS   0/29554
POL_INC   0/14731
PSION   103
R20_ADMIN   1123
R20_AMATORRADIO   0/2
R20_BEST_OF_FIDONET   13
R20_CHAT   0/893
R20_DEPP   0/3
R20_DEV   399
R20_ECHO2   1379
R20_ECHOPRES   0/35
R20_ESTAT   0/719
R20_FIDONETPROG...
...RAM.MYPOINT
  0/2
R20_FIDONETPROGRAM   0/22
R20_FIDONET   0/248
R20_FILEFIND   0/24
R20_FILEFOUND   0/22
R20_HIFI   0/3
R20_INFO2   3249
R20_INTERNET   0/12940
R20_INTRESSE   0/60
R20_INTR_KOM   0/99
R20_KANDIDAT.CHAT   42
R20_KANDIDAT   28
R20_KOM_DEV   112
R20_KONTROLL   0/13300
R20_KORSET   0/18
R20_LOKALTRAFIK   0/24
R20_MODERATOR   0/1852
R20_NC   76
R20_NET200   245
R20_NETWORK.OTH...
...ERNETS
  0/13
R20_OPERATIVSYS...
...TEM.LINUX
  0/44
R20_PROGRAMVAROR   0/1
R20_REC2NEC   534
R20_SFOSM   0/341
R20_SF   0/108
R20_SPRAK.ENGLISH   0/1
R20_SQUISH   107
R20_TEST   2
R20_WORST_OF_FIDONET   12
RAR   0/9
RA_MULTI   106
RA_UTIL   0/162
REGCON.EUR   0/2056
REGCON   0/13
SCIENCE   0/1206
SF   0/239
SHAREWARE_SUPPORT   0/5146
SHAREWRE   0/14
SIMPSONS   0/169
STATS_OLD1   0/2539.065
STATS_OLD2   0/2530
STATS_OLD3   0/2395.095
STATS_OLD4   0/1692.25
SURVIVOR   0/495
SYSOPS_CORNER   0/3
SYSOP   0/84
TAGLINES   0/112
TEAMOS2   0/4530
TECH   0/2617
TEST.444   0/105
TRAPDOOR   0/19
TREK   0/755
TUB   0/290
UFO   0/40
UNIX   0/1316
USA_EURLINK   0/102
USR_MODEMS   0/1
VATICAN   0/2740
VIETNAM_VETS   0/14
VIRUS   0/378
VIRUS_INFO   0/201
VISUAL_BASIC   0/473
WHITEHOUSE   0/5187
WIN2000   0/101
WIN32   0/30
WIN95   0/4289
WIN95_OLD1   0/70272
WINDOWS   0/1517
WWB_SYSOP   0/419
WWB_TECH   0/810
ZCC-PUBLIC   0/1
ZEC   4

 
4DOS   0/134
ABORTION   0/7
ALASKA_CHAT   0/506
ALLFIX_FILE   0/1313
ALLFIX_FILE_OLD1   0/7997
ALT_DOS   0/152
AMATEUR_RADIO   0/1039
AMIGASALE   0/14
AMIGA   0/331
AMIGA_INT   0/1
AMIGA_PROG   0/20
AMIGA_SYSOP   0/26
ANIME   0/15
ARGUS   0/924
ASCII_ART   0/340
ASIAN_LINK   0/651
ASTRONOMY   0/417
AUDIO   0/92
AUTOMOBILE_RACING   0/105
BABYLON5   0/17862
BAG   135
BATPOWER   0/361
BBBS.ENGLISH   0/382
BBSLAW   0/109
BBS_ADS   0/5290
BBS_INTERNET   0/507
BIBLE   0/3563
BINKD   0/1119
BINKLEY   0/215
BLUEWAVE   0/2173
CABLE_MODEMS   0/25
CBM   0/46
CDRECORD   0/66
CDROM   0/20
CLASSIC_COMPUTER   0/378
COMICS   0/15
CONSPRCY   0/899
COOKING   33421
COOKING_OLD1   0/24719
COOKING_OLD2   0/40862
COOKING_OLD3   0/37489
COOKING_OLD4   0/35496
COOKING_OLD5   9370
C_ECHO   0/189
C_PLUSPLUS   0/31
DIRTY_DOZEN   0/201
DOORGAMES   0/2065
DOS_INTERNET   0/196
duplikat   6002
ECHOLIST   0/18295
EC_SUPPORT   0/318
ELECTRONICS   0/359
ELEKTRONIK.GER   1534
ENET.LINGUISTIC   0/13
ENET.POLITICS   0/4
ENET.SOFT   0/11701
ENET.SYSOP   33945
ENET.TALKS   0/32
ENGLISH_TUTOR   0/2000
EVOLUTION   0/1335
FDECHO   0/217
FDN_ANNOUNCE   0/7068
FIDONEWS   24159
FIDONEWS_OLD1   0/49742
FIDONEWS_OLD2   0/35949
FIDONEWS_OLD3   0/30874
FIDONEWS_OLD4   0/37224
FIDO_SYSOP   12852
FIDO_UTIL   0/180
FILEFIND   0/209
FILEGATE   0/212
FILM   0/18
FNEWS_PUBLISH   4436
FN_SYSOP   41706
FN_SYSOP_OLD1   71952
FTP_FIDO   0/2
FTSC_PUBLIC   0/13613
FUNNY   0/4886
GENEALOGY.EUR   0/71
GET_INFO   105
GOLDED   0/408
HAM   0/16074
HOLYSMOKE   0/6791
HOT_SITES   0/1
HTMLEDIT   0/71
HUB203   466
HUB_100   264
HUB_400   39
HUMOR   0/29
IC   0/2851
INTERNET   0/424
INTERUSER   0/3
IP_CONNECT   719
JAMNNTPD   0/233
JAMTLAND   0/47
KATTY_KORNER   0/41
LAN   0/16
LINUX-USER   0/19
LINUXHELP   0/1155
LINUX   0/22112
LINUX_BBS   0/957
mail   18.68
mail_fore_ok   249
MENSA   0/341
MODERATOR   0/102
MONTE   0/992
MOSCOW_OKLAHOMA   0/1245
MUFFIN   0/783
MUSIC   0/321
N203_STAT   930
N203_SYSCHAT   313
NET203   321
NET204   69
NET_DEV   0/10
NORD.ADMIN   0/101
NORD.CHAT   0/2572
NORD.FIDONET   189
NORD.HARDWARE   0/28
NORD.KULTUR   0/114
NORD.PROG   0/32
NORD.SOFTWARE   0/88
NORD.TEKNIK   0/58
NORD   0/453
OCCULT_CHAT   0/93
OS2BBS   0/787
OS2DOSBBS   0/580
Möte PERL, 457 texter
 lista första sista föregående nästa
Text 191, 116 rader
Skriven 2005-02-20 15:05:50 av Maurice Kinal (1:153/401.1)
    Kommentar till text 190 av mark lewis (1:3634/12)
Ärende: talking to myself
=========================
Hey mark!

Feb 20 16:42 05, mark lewis wrote to Maurice Kinal:

 ml> MSGID isn't the magic bullet that some seem to want to think it is... 
 ml> some of your comments appear to be saying that it should/could be and 
 ml> that it isn't and thus should be thrown in the bitbucket...

Yes and no.  What I am trying to say, or what I think I am saying, is that
without rhyme or reason ALL that is good for won't make any difference whether
it is thrown into the bitbucket or not.  Without any meaningful logic it is a
complete waste of bytes and processing it causes.  With logic it has potential
as a viable accounting flag/tag/whatever.  I still have doubts about it's
dupechecking abilities but at least it has some potential.  Currently I have
doubts about any real usefulness to man or machine.

 ml> while i do /tend/ to agree, i also tend more to not agree... 

Sounds reasonable.

 ml> even then, you then have the problem of crossposted messages... is it 
 ml> a dupe because it is exactly the same message in more than one area? 
 ml> i don't think so...

Nor do I.  As long as it is accountable in the area it shows up in then I can't
see a problem with it, even if it shows up in other areas.  However, having
said that, I'd think there may be a better way <patent pending> to archive
crossposted messages where one carries more then one area where that message is
"posted" to.  A single message could fly more then one area tag.  Thus some
redundancy could effectively be eliminated.  No?

 ml> detecting duplicates in fidonet is a tricky science,

I would agree with that assessment.

 ml> messaging... wildcat, pcboard and wwiv systems are the first three 
 ml> that come to mind as having shoehorned retrofits for participation in 
 ml> fidonet... quite simply, their message bases were not designed with 
 ml> fidonet in mind... actually, not just fidonet but more without any 
 ml> sort of thought to control lines within messages...

Right.  Having a trimmed down archiving system where all stored messages only
contain what is absolutely needed to successfully be deemed a "message" - say
"To", "From", "Date" - and then tack on whatever else is required depending on
the target, would greatly reduce the amount of information any archived base or
area needs to know.  For instance a dynamic cgi script could take this
information and "convert" it to html display to the end user without affecting
the archive in any meaningful way, and that exact same archive could be
employed to construct outbound Fido compliant pkts.

 ml> it is long past the time when this stuff can truely be fixed and 
 ml> enforced...

Probably but that doesn't mean we can't discuss, and/or employ, any of this
"stuff" to our advantage.  Chances are by doing that we may all find ourselves
complying out of choice as opposed to enforcement ... or so the theory goes.

 ml> all we can do now is to play the game and hope for the 
 ml> best...

That is one way.

 ml> that takes us to the question of how to build a dataset of messages 
 ml> and what to use as the duplicate trigger...

Right.

 ml> things are done in binary in fidonet because of limited storage space 
 ml> as well as for speed of processing, we have to ask what method would 
 ml> ultimately be the best for quick processing, small storage, and 
 ml> generating truely unique IDs for the local duplicate detection 
 ml> system?

That is a toughy for sure.  Again I would think a standard method of generation
of MSGID would be of great assistance to all.  It isn't foolproof (is
anything?) but it would help.

 ml> i can see possibly a two fold method involving recording the actual 
 ml> header data as well as running it thru md5 or some such and recording 
 ml> the MSGID if it exists...

Possibly.  It sounds like it has potential.

 ml> speed... how much time are you willing to spend rummaging thru a 
 ml> duplicate dataset looking for a match before deciding if a message is 
 ml> a duplicate or not?

Heh, heh.  It depends on how big a problem dupes really are.  Not many REAL
dupes and then I would say zero "rummaging", but if I were Rusty and seeing
hundreds of REAL dupes then I'd really wish my uplink was doing better quality
control.  But then that of course brings up the question whether or not the
uplink isn't filtering out messages that aren't really dupes but instead MSGID
dupes.  I've seen those and have seriously wondered if the few I do manage to
see aren't representative of a far greater and unseen problem regarding the
whole MSGID situation as it stands today.

 ml> considering your high desire for speed, i can see 
 ml> small datasets (one per message area al la squish?) to ease the 
 ml> search time...

Possibly.  I have been pondering what I wish to do locally for myself all the
way around, not just Fido.

 ml> interesting problem, this is... i'm already visualising multiple dupe 
 ml> dataset files based on the AREA line, locally carried areas 
 ml> notwithstanding due to the processing of passthru areas, or one large 
 ml> or even multiple large datafiles containing AREA grouped datasets of 
 ml> header and MSGID data...

Interesting to ponder.

Life is good,
Maurice

--- Msged/LNX 6.1.2
 * Origin: Coffin Point - Ladysmith, BC Canada (1:153/401.1)