Tillbaka till svenska Fidonet
English   Information   Debug  
OS2BBS   0/787
OS2DOSBBS   0/580
OS2HW   0/42
OS2INET   0/37
OS2LAN   0/134
OS2PROG   0/36
OS2REXX   0/113
OS2USER-L   207
OS2   0/4806
OSDEBATE   4976/18996
PASCAL   0/490
PERL   452/457
PHP   0/45
POINTS   0/405
POLITICS   23536/29554
POL_INC   0/14731
PSION   103
R20_ADMIN   1132
R20_AMATORRADIO   0/2
R20_BEST_OF_FIDONET   15
R20_CHAT   0/894
R20_DEPP   0/3
R20_DEV   400
R20_ECHO2   1675
R20_ECHOPRES   0/35
R20_ESTAT   0/719
R20_FIDONETPROG...
...RAM.MYPOINT
  0/2
R20_FIDONETPROGRAM   0/22
R20_FIDONET   0/248
R20_FILEFIND   0/24
R20_FILEFOUND   0/22
R20_HIFI   0/3
R20_INFO2   3565
R20_INTERNET   0/12940
R20_INTRESSE   0/60
R20_INTR_KOM   0/99
R20_KANDIDAT.CHAT   42
R20_KANDIDAT   28
R20_KOM_DEV   112
R20_KONTROLL   0/13360
R20_KORSET   0/18
R20_LOKALTRAFIK   0/24
R20_MODERATOR   919/1852
R20_NC   76
R20_NET200   245
R20_NETWORK.OTH...
...ERNETS
  0/13
R20_OPERATIVSYS...
...TEM.LINUX
  0/44
R20_PROGRAMVAROR   0/1
R20_REC2NEC   534
R20_SFOSM   0/341
R20_SF   0/108
R20_SPRAK.ENGLISH   0/1
R20_SQUISH   107
R20_TEST   2
R20_WORST_OF_FIDONET   20
RAR   0/9
RA_MULTI   106
RA_UTIL   0/162
REGCON.EUR   0/2066
REGCON   0/13
SCIENCE   0/1206
SF   0/239
SHAREWARE_SUPPORT   0/5146
SHAREWRE   0/14
SIMPSONS   0/169
STATS_OLD1   1571/2539.065
STATS_OLD2   2328/2530
STATS_OLD3   2256/2395.095
STATS_OLD4   0/1692.25
SURVIVOR   0/495
SYSOPS_CORNER   0/3
SYSOP   0/84
TAGLINES   0/112
TEAMOS2   3999/4530
TECH   1651/2617
TEST.444   0/105
TRAPDOOR   0/19
TREK   0/755
TUB   0/290
UFO   0/40
UNIX   0/1316
USA_EURLINK   0/102
USR_MODEMS   0/1
VATICAN   0/2740
VIETNAM_VETS   0/14
VIRUS   0/378
VIRUS_INFO   0/201
VISUAL_BASIC   0/473
WHITEHOUSE   0/5187
WIN2000   0/101
WIN32   0/30
WIN95   0/4291
WIN95_OLD1   22897/70272
WINDOWS   0/1517
WWB_SYSOP   0/419
WWB_TECH   0/810
ZCC-PUBLIC   0/1
ZEC   4

 
4DOS   0/134
ABORTION   0/7
ALASKA_CHAT   92/506
ALLFIX_FILE   901/1313
ALLFIX_FILE_OLD1   0/7997
ALT_DOS   0/152
AMATEUR_RADIO   978/1039
AMIGASALE   0/14
AMIGA   0/331
AMIGA_INT   0/1
AMIGA_PROG   0/20
AMIGA_SYSOP   0/26
ANIME   0/15
ARGUS   0/924
ASCII_ART   0/340
ASIAN_LINK   0/651
ASTRONOMY   0/417
AUDIO   0/92
AUTOMOBILE_RACING   0/105
BABYLON5   12101/17862
BAG   135
BATPOWER   0/361
BBBS.ENGLISH   0/382
BBSLAW   0/109
BBS_ADS   3698/5290
BBS_INTERNET   0/507
BIBLE   0/3563
BINKD   600/1119
BINKLEY   0/215
BLUEWAVE   2144/2173
CABLE_MODEMS   0/25
CBM   0/46
CDRECORD   0/66
CDROM   0/20
CLASSIC_COMPUTER   0/378
COMICS   0/15
CONSPRCY   0/899
COOKING   40011
COOKING_OLD1   20687/24719
COOKING_OLD2   9955/40862
COOKING_OLD3   29272/37489
COOKING_OLD4   34444/35496
COOKING_OLD5   9370
C_ECHO   0/189
C_PLUSPLUS   0/31
DIRTY_DOZEN   0/201
DOORGAMES   1404/2155
DOS_INTERNET   0/196
duplikat   6102
ECHOLIST   0/18295
EC_SUPPORT   0/318
ELECTRONICS   0/359
ELEKTRONIK.GER   1534
ENET.LINGUISTIC   0/13
ENET.POLITICS   0/4
ENET.SOFT   0/11701
ENET.SYSOP   34212
ENET.TALKS   0/32
ENGLISH_TUTOR   1466/2000
EVOLUTION   0/1335
FDECHO   0/217
FDN_ANNOUNCE   0/7068
FIDONEWS   24756
FIDONEWS_OLD1   18659/49742
FIDONEWS_OLD2   8655/35949
FIDONEWS_OLD3   7745/30874
FIDONEWS_OLD4   2496/37224
FIDO_SYSOP   12913
FIDO_UTIL   0/180
FILEFIND   175/209
FILEGATE   0/212
FILM   0/18
FNEWS_PUBLISH   4758
FN_SYSOP   42065
FN_SYSOP_OLD1   71952
FTP_FIDO   0/2
FTSC_PUBLIC   12457/13899
FUNNY   0/4886
GENEALOGY.EUR   0/71
GET_INFO   105
GOLDED   0/408
HAM   0/16425
HOLYSMOKE   0/6791
HOT_SITES   0/1
HTMLEDIT   0/71
HUB203   466
HUB_100   264
HUB_400   39
HUMOR   0/29
IC   0/2851
INTERNET   0/424
INTERUSER   0/3
IP_CONNECT   719
JAMNNTPD   0/233
JAMTLAND   0/47
KATTY_KORNER   0/41
LAN   0/16
LINUX-USER   0/19
LINUXHELP   0/1155
LINUX   4431/22268
LINUX_BBS   0/957
mail   18.68
mail_fore_ok   249
MENSA   0/341
MODERATOR   0/102
MONTE   0/992
MOSCOW_OKLAHOMA   0/1245
MUFFIN   0/783
MUSIC   0/321
N203_STAT   938
N203_SYSCHAT   313
NET203   321
NET204   69
NET_DEV   0/10
NORD.ADMIN   0/101
NORD.CHAT   0/2572
NORD.FIDONET   189
NORD.HARDWARE   0/28
NORD.KULTUR   0/114
NORD.PROG   0/32
NORD.SOFTWARE   0/88
NORD.TEKNIK   0/58
NORD   0/453
OCCULT_CHAT   0/93
Möte OSDEBATE, 18996 texter
 lista första sista föregående nästa
Text 2149, 61 rader
Skriven 2005-01-25 10:33:48 av Chris (1:379/45)
   Kommentar till text 2148 av Chris (1:379/45)
Ärende: Re: An open source Google - without the ads
===================================================
From: Chris <nospam@noemail>

http://www.scroogle.org/gscrape.html

Scraping and ad-stripping Google's results

If done in the public interest and not for profit, it's legal. What's more,
Google can't block you if they can't find you.


Public Information Research, Inc., the nonprofit public charity behind
www.google-watch.org and www.scroogle.org, has been running a Google proxy for
more than two years. On January 3, 2005 we released the source code for our
proxy. Our review of the legal situation has convinced us that we are covered
by "fair use" under the Copyright Act.

This step that we have taken has implications for all search engines. These
engines crawl the public web without asking permission, and cache and reproduce
the content without asking permission, and then use this information as a
carrier for ads that generate private profit. We are convinced that if citizens
scrape Google and strip the ads, and make the scraped results available as a
nonprofit public service, that this is legal. This is especially the case if
there are public policy concerns behind the scraping.

Google Watch has been the most prominent critic of Google's outrageous privacy
policies for more than two years. This is why we started the proxy, and it's
why we continue the proxy. We invite Google to serve us with a cease and desist
letter as a first step toward resolving this issue. So far, we have yet to hear
from Google's lawyers. By releasing the source code for our proxy, we're trying
to escalate the issue.

If it can be established that what we're doing is legal -- or at least
sufficiently legal so that Google is not eager to challenge us -- then this
will begin to restore a public-interest balance to the web that has been
declining ever since big money got behind the dot-coms.

There is the additional problem of whether anyone who scrapes Google can avoid
getting blocked by Google. We experienced this when Google blocked Scroogle in
December, 2003. We moved to a different server and continued as before, because
Google could no longer find us. In our opinion, it's legal for Google to block
whomever they want, even while it's also legal for us to scrape them if we can.

If the scraping is done properly, it is not worth Google's trouble to find you.
Our source code separates the "fetch" portion of program, which is done by curl
or wget, from the searcher interface and parsing of the fetched results. If the
fetching is done by a server on a different Class C address from the website
that shows the scraped results, there is little that Google can do to find the
IP address that is responsible for the actual fetch.

Chris wrote:
> http://www.theregister.co.uk/2005/01/11/open_source_google_scraper/
> (context links below story)
>
> An open source Google - without the ads
> By Andrew Orlowski in San Francisco
> Published Tuesday 11th January 2005 09:44 GMT
>
<snip>

--- BBBS/NT v4.01 Flag-5
 * Origin: Barktopia BBS Site http://HarborWebs.com:8081 (1:379/45)