Jump to content

User:לערי ריינהארט/tests/google bugzilla:707

From Wikipedia, the free encyclopedia

ro:User:Gangleri/tests/google bugzilla:707

  1. google:Johann Wolfgang von Goethe is
    at ro: http://www.google.com/search?q=Johann_Wolfgang_von_Goethe
    at en: http://www.google.com/search?q=Johann_Wolfgang_von_Goethe
  2. google:Johann-Wolfgang-von-Goethe is
    at ro: http://www.google.com/search?q=Johann-Wolfgang-von-Goethe
    at en: http://www.google.com/search?q=Johann-Wolfgang-von-Goethe
  3. google:Gerhard Schröder is
    at ro: http://www.google.com/search?q=Gerhard+Schr%C3%B6der
    at en: http://www.google.com/search?q=Gerhard_Schr%F6der
  4. google:Gerhard-Schröder is
    at ro: http://www.google.com/search?q=Gerhard-Schr%C3%B6der
    at en: http://www.google.com/search?q=Gerhard-Schr%F6der
  5. http://www.google.com/search?num=100&lr=&newwindow=1&q=+%22Johann+Wolfgang+von+Goethe%22

preliminary

[edit]
  • Working with different variations of gogle links and using these in templates I found some very special kinds of parametrisation:
  1. "bordeaux" "" "" used in
    http://www.google.com/search?newwindow=1&num=100&q=+%22bordeaux%22+%22%22+%22%22
  2. site:.Wikipedia.org -site:en.Wikipedia.org -site:fr.Wikipedia.org -site:de.Wikipedia.org -site:nl.Wikipedia.org "bordeaux" "" "" used in
    http://www.google.com/search?newwindow=1&num=100&q=+site:.Wikipedia.org+-site:en.Wikipedia.org+-site:fr.Wikipedia.org+-site:de.Wikipedia.org+-site:nl.Wikipedia.org+%22bordeaux%22+%22%22+%22%22
  3. Johann-Wolfgang-von-Goethe used in
    http://www.google.com/search?q=Johann-Wolfgang-von-Goethe
    It seams to be an equivqlent for
  4. "Johann Wolfgang von Goethe" used in
    http://www.google.com/search?&q=%22Johann+Wolfgang+von+Goethe%22
    also equivalent for the use of as_epq in
  5. http://www.google.com/search?as_epq=Johann+Wolfgang+von+Goethe


  • I have never read google specification It is long time since I used boolean edxpressions at http://www.leit.is/ another serch engine. Before we take advantage of this documented or undocumented features we should know about if they are stable. Who can look at this?
  • The "features" mentioned above are used at ro:Template talk:Lincuri/google/selectiv. I agree they are complex - some may say they are complicated ;-) I used similar at templates at eo: for a working list to find easily lots of pages from all wikipedia generating "odd" links see bugzilla:1512.


  • Whatever is used in [[XXXX:YYYY:ZZZZ]] is translated diffrently both for the special pages, for titles (articles, discussion pages etc). at a Latin-1 wiki as en: and a UTF-8 type wiki as ro:.
  1. Both templates work the same if all XXXX:YYYY:ZZZZ consists of ascii characters only.
  2. First differences can be seen if characters as ÄÖÜäöüß ÀÁ etc. are contained XXXX:YYYY:ZZZZ.
  3. Due to the fact that UTF-8 characters are stored as &amp#nnnn; on Latin-1 type wikis XXXX:YYYY:ZZZZ will translate to this and will generate wrong (in order not to call it useless) code for example google links. Other links generated by the parameter refering to titles will fail, but this is a known fact.

What would be the best solution ...

[edit]

... for references to [[google:YYYY]] other wikis and also for parametrisation of templates?

  • As Brion mentioned at bugzilla:707#comment_text_6 "One might add a field to the interwiki table listing it, or perhaps some funky alternate replacement: currently the interwiki URLs contain $1 as a placeholder for the link with underscores; another placeholder for spaces perhaps?" there are some alternatives: "We" (hopefully there is an interet on this) should evaluate if $2 and other values could be used for a subset of the following requirements:
  1. to use / to generate UTF-8 code (or equivalent translations) regardsless of what type the wiki is
  2. translate spaces in "-" if this would be the the most suitable for google
    alternatively to other characters if there is a requirement / a real need for it
  3. to use lowercase letters only (this would allow the usage of the same function at eo: too)

more ideas

[edit]

feedback - comments

[edit]
  • ...