regex - robot.txt syntax not understood -
i getting error on first line of robot.txt user-agent: *
my robot.txt follows:
user-agent: * disallow: /search/ disallow: /_layouts/ disallow: /blog/_layouts/ disallow: /blog/_layouts/ disallow: /reusablecontent/ disallow: /reports%20list/ disallow: /workflowtasks/ disallow: /sitecollectionimages/ disallow: /documents/forms/ disallow: /pages/forms/ disallow: /internet/
i have figured there space /reports%20list/
. creating issues? not sure. appreciated.
the trouble here 3 invisible bytes @ beginning of utf-8 formatted robots.txt file, called bom (byte order mask). bom optional. yes, need file utf-8 formatted, google not optional bom in expected clear , simple robots.txt file , parsing fails.
you can read more detailed explanation here.
Comments
Post a Comment