Aller au contenu

« Utilisateur:DumZiBoT/Test » : différence entre les versions

Une page de Wikipédia, l'encyclopédie libre.
Contenu supprimé Contenu ajouté
NicDumZ (discuter | contributions)
DumZiBoT (discuter | contributions)
m Bot: Correction des refs. mal formatées (cf. explications)
Ligne 46 : Ligne 46 :
* <ref>http://www.jstor.org/cgi-bin/jstor/viewitem/00223816/di976533/97p0420z/0?frame=noframe&dpi=3&userID=ca4491e6@iitb.ac.in/01cce4405e00501c2d76b&backcontext=page</ref> JSTOR, excluded
* <ref>http://www.jstor.org/cgi-bin/jstor/viewitem/00223816/di976533/97p0420z/0?frame=noframe&dpi=3&userID=ca4491e6@iitb.ac.in/01cce4405e00501c2d76b&backcontext=page</ref> JSTOR, excluded
* <ref>http://www.medscape.com/viewarticle/554347?sssdmh=dm1.259053&src=ddd</ref> blacklisted ?
* <ref>http://www.medscape.com/viewarticle/554347?sssdmh=dm1.259053&src=ddd</ref> blacklisted ?
* <ref>http://www.sci.aha.ru/ATL/ra13a.htm</ref> encoding problems
* <ref>[http://www.sci.aha.ru/ATL/ra13a.htm Плотность Населения И Система Расселения<!-- Titre généré automatiquement -->]</ref> encoding problems


=for de:=
=for de:=

Version du 4 février 2008 à 19:08

Legend :

  • DumZiBoT behavior when fine
  • not-so-good title found by DumZiBoT

en

  • [1] - text/html for .pdf file (soft404 actually without redirect) 404 - page not found
  • [2] - text/html for .txt file OK
  • [3] - No type or length OK
  • [4] - application/pdf but my tool reports it as text/html (python issue?) media
  • [5] (url_info) — Cookies required redirect 404
  • [6] (url_info) — NY Times login OK
  • [7] (url_info) — NY Times login, long list of redirects 404
  • [8] (url_info) — Redirect to login page Login - :: cns news ::
  • [9] (url_info) — A false negative on my tool Resource secured
  • [10] (url_info) — Article text removed OK (a title is found, even if the article text is missing...)
  • [11] (url_info) — Soft 404 redirect 404
  • [12] (url_info) — Expired Google Cache 403
  • [13] (url_info) — Redirects to /index.html redirect to root
  • [14] (url_info) — Redirects to /err_404.html redirect 404

DumZiBoT previous problems

  • [15] not detected as a link
  • [16] OK, converted case
  • [17] sign in page
  • [18] loading
  • [19] 404 - page not found
  • [20] no archiving spiders allowed

Misc, current tests

for de:

...