﻿id	summary	reporter	owner	description	type	status	priority	milestone	component	resolution	keywords	cc	parents
1047	Server error when URL contains weird characters	ayleph		"There's an entry in my logfiles for a bot trying to access a specific URL which causes a server error:
{{{'ascii' codec can't encode character u'\ufeff' in position 14: ordinal not in range(128)}}}

{{{
Error - <type 'exceptions.UnicodeEncodeError'>: 'ascii' codec can't encode character u'\ufeff' in position 14: ordinal not in range(128)
URL: https://goblinrefuge.com/mediagoblin/u/stwaldman/%7C%EF%BB%BFSEO
File '/path/to/mediagoblin/lib/python2.7/site-packages/Paste-1.7.5.1-py2.7.egg/paste/exceptions/errormiddleware.py', line 144 in __call__
  app_iter = self.application(environ, sr_checker)
File '/path/to/mediagoblin/lib/python2.7/site-packages/Paste-1.7.5.1-py2.7.egg/paste/urlmap.py', line 203 in __call__
  return app(environ, start_response)
File '/path/to/mediagoblin/mediagoblin/app.py', line 268 in __call__
  return self.call_backend(environ, start_response)
File '/path/to/mediagoblin/lib/python2.7/site-packages/Werkzeug-0.9.6-py2.7.egg/werkzeug/wsgi.py', line 567 in __call__
  cleaned_path = cleaned_path.encode(sys.getfilesystemencoding())
UnicodeEncodeError: 'ascii' codec can't encode character u'\ufeff' in position 14: ordinal not in range(128)


CGI Variables
-------------
  DOCUMENT_ROOT: '/path/to/mediagoblin/mediagoblin'
  GATEWAY_INTERFACE: 'CGI/1.1'
  HTTPS: 'on'
  HTTP_ACCEPT: 'text/html,text/plain,text/xml,text/*,application/xml,application/xhtml+xml,application/rss+xml,application/atom+xml,application/rdf+xml'
  HTTP_ACCEPT_LANGUAGE: 'en'
  HTTP_CONNECTION: 'keep-alive'
  HTTP_HOST: 'goblinrefuge.com'
  HTTP_USER_AGENT: 'Mozilla/5.0 (compatible; MJ12bot/v1.4.5; http://www.majestic12.co.uk/bot.php?+)'
  PATH_INFO: '/u/stwaldman/|\xef\xbb\xbfSEO'
  PATH_TRANSLATED: '/path/to/mediagoblin/mediagoblin/u/stwaldman/|\xef\xbb\xbfSEO'
  REDIRECT_STATUS: '200'
  REMOTE_ADDR: '127.0.0.1'
  REMOTE_PORT: '58734'
  REQUEST_METHOD: 'GET'
  REQUEST_URI: '/mediagoblin/u/stwaldman/%7C%EF%BB%BFSEO'
  SCRIPT_FILENAME: '/path/to/mediagoblin/mediagoblin/mediagoblin'
  SCRIPT_NAME: '/mediagoblin'
  SERVER_PROTOCOL: 'HTTP/1.1'
  SERVER_SOFTWARE: 'lighttpd/1.4.35'
}}}"	defect	closed	major	0.9.0	programming	invalid	url,ascii,encode,error		
