Tillbaka till svenska Fidonet
English   Information   Debug  
OS2HW   0/42
OS2INET   0/37
OS2LAN   0/134
OS2PROG   0/36
OS2REXX   0/113
OS2USER-L   207
OS2   4785/4804
OSDEBATE   0/18996
PASCAL   0/490
PERL   0/457
PHP   0/45
POINTS   0/405
POLITICS   0/29554
POL_INC   0/14731
PSION   103
R20_ADMIN   1124
R20_AMATORRADIO   0/2
R20_BEST_OF_FIDONET   13
R20_CHAT   0/893
R20_DEPP   0/3
R20_DEV   399
R20_ECHO2   1387
R20_ECHOPRES   0/35
R20_ESTAT   0/719
R20_FIDONETPROG...
...RAM.MYPOINT
  0/2
R20_FIDONETPROGRAM   0/22
R20_FIDONET   0/248
R20_FILEFIND   0/24
R20_FILEFOUND   0/22
R20_HIFI   0/3
R20_INFO2   3281
R20_INTERNET   0/12940
R20_INTRESSE   0/60
R20_INTR_KOM   0/99
R20_KANDIDAT.CHAT   42
R20_KANDIDAT   28
R20_KOM_DEV   112
R20_KONTROLL   0/13332
R20_KORSET   0/18
R20_LOKALTRAFIK   0/24
R20_MODERATOR   0/1852
R20_NC   76
R20_NET200   245
R20_NETWORK.OTH...
...ERNETS
  0/13
R20_OPERATIVSYS...
...TEM.LINUX
  0/44
R20_PROGRAMVAROR   0/1
R20_REC2NEC   534
R20_SFOSM   0/341
R20_SF   0/108
R20_SPRAK.ENGLISH   0/1
R20_SQUISH   107
R20_TEST   2
R20_WORST_OF_FIDONET   12
RAR   0/9
RA_MULTI   106
RA_UTIL   0/162
REGCON.EUR   0/2056
REGCON   0/13
SCIENCE   0/1206
SF   0/239
SHAREWARE_SUPPORT   0/5146
SHAREWRE   0/14
SIMPSONS   0/169
STATS_OLD1   0/2539.065
STATS_OLD2   0/2530
STATS_OLD3   0/2395.095
STATS_OLD4   0/1692.25
SURVIVOR   0/495
SYSOPS_CORNER   0/3
SYSOP   0/84
TAGLINES   0/112
TEAMOS2   0/4530
TECH   0/2617
TEST.444   0/105
TRAPDOOR   0/19
TREK   0/755
TUB   0/290
UFO   0/40
UNIX   0/1316
USA_EURLINK   0/102
USR_MODEMS   0/1
VATICAN   0/2740
VIETNAM_VETS   0/14
VIRUS   0/378
VIRUS_INFO   0/201
VISUAL_BASIC   0/473
WHITEHOUSE   0/5187
WIN2000   0/101
WIN32   0/30
WIN95   0/4290
WIN95_OLD1   0/70272
WINDOWS   0/1517
WWB_SYSOP   0/419
WWB_TECH   0/810
ZCC-PUBLIC   0/1
ZEC   4

 
4DOS   0/134
ABORTION   0/7
ALASKA_CHAT   0/506
ALLFIX_FILE   0/1313
ALLFIX_FILE_OLD1   0/7997
ALT_DOS   0/152
AMATEUR_RADIO   0/1039
AMIGASALE   0/14
AMIGA   0/331
AMIGA_INT   0/1
AMIGA_PROG   0/20
AMIGA_SYSOP   0/26
ANIME   0/15
ARGUS   0/924
ASCII_ART   0/340
ASIAN_LINK   0/651
ASTRONOMY   0/417
AUDIO   0/92
AUTOMOBILE_RACING   0/105
BABYLON5   0/17862
BAG   135
BATPOWER   0/361
BBBS.ENGLISH   0/382
BBSLAW   0/109
BBS_ADS   0/5290
BBS_INTERNET   0/507
BIBLE   0/3563
BINKD   0/1119
BINKLEY   0/215
BLUEWAVE   0/2173
CABLE_MODEMS   0/25
CBM   0/46
CDRECORD   0/66
CDROM   0/20
CLASSIC_COMPUTER   0/378
COMICS   0/15
CONSPRCY   0/899
COOKING   34033
COOKING_OLD1   0/24719
COOKING_OLD2   0/40862
COOKING_OLD3   0/37489
COOKING_OLD4   0/35496
COOKING_OLD5   9370
C_ECHO   0/189
C_PLUSPLUS   0/31
DIRTY_DOZEN   0/201
DOORGAMES   0/2069
DOS_INTERNET   0/196
duplikat   6002
ECHOLIST   0/18295
EC_SUPPORT   0/318
ELECTRONICS   0/359
ELEKTRONIK.GER   1534
ENET.LINGUISTIC   0/13
ENET.POLITICS   0/4
ENET.SOFT   0/11701
ENET.SYSOP   33966
ENET.TALKS   0/32
ENGLISH_TUTOR   0/2000
EVOLUTION   0/1335
FDECHO   0/217
FDN_ANNOUNCE   0/7068
FIDONEWS   24205
FIDONEWS_OLD1   0/49742
FIDONEWS_OLD2   0/35949
FIDONEWS_OLD3   0/30874
FIDONEWS_OLD4   0/37224
FIDO_SYSOP   12853
FIDO_UTIL   0/180
FILEFIND   0/209
FILEGATE   0/212
FILM   0/18
FNEWS_PUBLISH   4475
FN_SYSOP   41736
FN_SYSOP_OLD1   71952
FTP_FIDO   0/2
FTSC_PUBLIC   0/13628
FUNNY   0/4886
GENEALOGY.EUR   0/71
GET_INFO   105
GOLDED   0/408
HAM   0/16095
HOLYSMOKE   0/6791
HOT_SITES   0/1
HTMLEDIT   0/71
HUB203   466
HUB_100   264
HUB_400   39
HUMOR   0/29
IC   0/2851
INTERNET   0/424
INTERUSER   0/3
IP_CONNECT   719
JAMNNTPD   0/233
JAMTLAND   0/47
KATTY_KORNER   0/41
LAN   0/16
LINUX-USER   0/19
LINUXHELP   0/1155
LINUX   0/22120
LINUX_BBS   0/957
mail   18.68
mail_fore_ok   249
MENSA   0/341
MODERATOR   0/102
MONTE   0/992
MOSCOW_OKLAHOMA   0/1245
MUFFIN   0/783
MUSIC   0/321
N203_STAT   934
N203_SYSCHAT   313
NET203   321
NET204   69
NET_DEV   0/10
NORD.ADMIN   0/101
NORD.CHAT   0/2572
NORD.FIDONET   189
NORD.HARDWARE   0/28
NORD.KULTUR   0/114
NORD.PROG   0/32
NORD.SOFTWARE   0/88
NORD.TEKNIK   0/58
NORD   0/453
OCCULT_CHAT   0/93
OS2BBS   0/787
OS2DOSBBS   0/580
Möte PERL, 457 texter
 lista första sista föregående nästa
Text 190, 90 rader
Skriven 2005-02-20 16:42:14 av mark lewis (1:3634/12)
    Kommentar till text 180 av Maurice Kinal (1:153/401.1)
Ärende: talking to myself
=========================
 RT>> with his "gate" a while back, that I know of that is.

 MK> That was one situation I as thinking of.  Got under everyone's
 MK> radar despite all the sysops in that echo noticing it.  So
 MK> much for MSGID eh?  Everyone of those messages was a true dupe
 MK> that not one tosser caught.

MSGID isn't the magic bullet that some seem to want to think it is... some of
your comments appear to be saying that it should/could be and that it isn't and
thus should be thrown in the bitbucket...

while i do /tend/ to agree, i also tend more to not agree... detecting dupes in
fidonet is not magic nor is it tied to one thing... for various reasons,
fidonet can't even use md5 checksums on the message body to determine if a
message is a duplicate... message headers, existing control lines, origin and
seenby and path lines can all be stripped or otherwise modified or corrupted...
the only real way to tell a dupe would be by enforcing some sort of message
body formatting and md5'ing the message body much the same way that PGP can be
used to sign a message to show if it has been modified since sending...

  ie: when the body is generated and the message saved,
      md5 the body and store that in a control line that
      travels with the message. i still don't recall if
      there are message processors out there that alter
      the message body (ie: by replacing CRLF with LF)

even then, you then have the problem of crossposted messages... is it a dupe
because it is exactly the same message in more than one area? i don't think
so...

detecting duplicates in fidonet is a tricky science, to say the least...
checking the header info and message control lines (including the origin line)
is about the only way... still this can fail due to the way some systems have
been retrofitted for fidonet messaging... wildcat, pcboard and wwiv systems are
the first three that come to mind as having shoehorned retrofits for
participation in fidonet... quite simply, their message bases were not designed
with fidonet in mind... actually, not just fidonet but more without any sort of
thought to control lines within messages...

it is long past the time when this stuff can truely be fixed and enforced...
all we can do now is to play the game and hope for the best... that said, there
are things that can be done to try to ensure that messages generated by your
software do make it past the various and sundry dupe checking schemes out
there... one of the first and easiest is to implement MSGID and ensure that it
is the first control line after the message header... this may or may not help
with very braindead dupe checking that looks to the header only with no regard
for the message body at all as that system was developed with a myopic view of
users creating messages and not with the thought that an automated process like
text file posting or offline mail doors may post more than one message per
second... most of the software that did that braindead method of dupe checking
have been tossed or upgraded for something that does the same but also takes
into consideration the first 20+ bytes of the message body...

there is still the problem of dupe checkers that use a CRC16 or CRC32 method of
storing an "ID" of a message based on the header and 20+ bytes of the
message... this is due to the simple fact that there are a limited number of
CRC16 and CRC32 results and that it is fairly trivial to find more than one
dataset that generates the same CRC16 or CRC32 value...

that takes us to the question of how to build a dataset of messages and what to
use as the duplicate trigger... remembering that many things are done in binary
in fidonet because of limited storage space as well as for speed of processing,
we have to ask what method would ultimately be the best for quick processing,
small storage, and generating truely unique IDs for the local duplicate
detection system?

the first thing i can think of is to record the header info and the entire
MSGID... the question is, then, how to record the header info? would one use
the actual fields or would one run the header fields thru a formula like md5 or
something else??

i can see possibly a two fold method involving recording the actual header data
as well as running it thru md5 or some such and recording the MSGID if it
exists...

that would likely be the utmost method but it wouldn't be the smallest data
record per message... there's also the question of speed... how much time are
you willing to spend rummaging thru a duplicate dataset looking for a match
before deciding if a message is a duplicate or not? considering your high
desire for speed, i can see small datasets (one per message area al la squish?)
to ease the search time...

interesting problem, this is... i'm already visualising multiple dupe dataset
files based on the AREA line, locally carried areas notwithstanding due to the
processing of passthru areas, or one large or even multiple large datafiles
containing AREA grouped datasets of header and MSGID data...

)\/(ark

 * Origin: (1:3634/12)