uCoz Community » For Webmasters » Site Promotion » Indexing Policy & Robots.txt
Indexing Policy & Robots.txt
Sunny
Posts: 9296
Reputation: 456

Message # 1 | 10:41 AM
Website's Indexing Status


All uCoz websites have Indexing status that is displayed at the top of the Control Panel's main page (/panel/?a=cp). The parameter shows whether indexing by search engines is allowed for the website or not (whether the website is in quarantine).
The indexing status can show one of the two options: "indexing is allowed (quarantine is removed)":



Or "indexing is prohibited (the website is in quarantine)":



The status "indexing is prohibited (the website is in quarantine)" is assigned by default to all newly created websites.

Quarantine Removal Policy


A website can become available for indexing either automatically (if a premium plan is purchased) or upon the website owner's request. If the website does not have a premium plan and the user wants the quarantine to be removed, a request should be submitted from the website's Control Panel:



There will be a pop-up window with the info on the quarantine policy:



After the request has been submitted, the website will be checked automatically according to a number criteria: the website's age, presence of a custom domain name, content, verified phone number etc. On the basis of these criteria the system decides whether the quarantine should be removed. We cannot provide a more detailed description of the algorithm.

Note! If the quarantine removal was denied, the next request can be submitted no sooner than in 7 days.


Robots.txt


A website's robots.txt file is located at http://your_website_address/robots.txt. A website with the default robots.txt is indexed in the best possible way – we set up the file in such a way that only pages with content are indexed, and not all existing pages (e.g. login or registration page). Therefore uCoz websites are indexed better and get higher priority in comparison with other sites where all unnecessary pages are indexed.

That's why we strongly recommend not to replace the default robots.txt by your own.


If you still want to replace the file by your own, create a text file using Notepad or any other text editor and name it "robots.txt". Then upload it to the root folder of your website via File Manager or FTP. Note: while website indexing is prohibited, no modification of the robots.txt file is possible.

The default robots.txt looks as follows:
Quote

User-agent: *
Allow: /*?page
Allow: /*?ref=
Allow: /stat/dspixel
Disallow: /*?
Disallow: /stat/
Disallow: /index/1
Disallow: /index/3
Disallow: /register
Disallow: /index/5
Disallow: /index/7
Disallow: /index/8
Disallow: /index/9
Disallow: /index/sub/
Disallow: /panel/
Disallow: /admin/
Disallow: /informer/
Disallow: /secure/
Disallow: /poll/
Disallow: /search/
Disallow: /abnl/
Disallow: /*_escaped_fragment_=
Disallow: /*-*-*-*-987$
Disallow: /shop/order/
Disallow: /shop/printorder/
Disallow: /shop/checkout/
Disallow: /shop/user/
Disallow: /*0-*-0-17$
Disallow: /*-0-0-

Sitemap: http://forum.ucoz.com/sitemap.xml
Sitemap: http://forum.ucoz.com/sitemap-forum.xml



Robots.txt during the quarantine looks as follows:

Quote

User-agent: *
Disallow: /




Robots.txt FAQ


Informers are not indexed because they display information that ALREADY exists. As a rule this information is already indexed on the corresponding pages.


Question: I have accidentally messed up robots.txt. What should I do?

Answer: Delete it. The default robots.txt file will be added back automatically (the system checks whether a website has it, and if not – adds back the default file).


Question: Is there any use in submitting a website to search engines if the quarantine hasn't been removed yet?

Answer: No, your website won't be indexed while in quarantine.


Question: Will the robots.txt file be replaced automatically after the quarantine has been removed? Or should I update it manually?

Answer: It will be updated automatically.


Question: Is it possible to delete the default robots.txt?

Answer: You can't delete it, it's a system file, but you can add your own file. However, we don't recommend to do this, as was stated above. During the quarantine it is impossible to upload a custom robots.txt.


Question: What should I do to forbid indexing of the following pages?
_http://site.ucoz.com/index/0-4
_http://site.ucoz.com/index/0-5

Answer: Add the following lines to the robots.txt file:
/index/0-4
/index/0-5


Question: I have forbidden indexing of some links by means of robots.txt but they are still displayed. Why is it so?

Answer: By means of robots.txt you can forbid indexing of pages, not links.


Question: I want to make some changes in my robots.txt file. How can I do this?

Answer: Download it to your PC, edit it and then upload it back via File Manager or FTP.

I'm not active on the forum anymore. Please contact other forum staff.
Natashko
Posts: 3366
Reputation: 171

Message # 76 | 1:59 PM
Torres, Robots.txt file is a system file. If you still want to substitute it by your own, create a text file using notepad or any other text editor and name it "robots.txt". Then upload it to the root folder of your site by means File Manager or FTP.
khen
Posts: 475
Reputation: 13

Message # 77 | 3:37 PM
Why do ucoz disallow the search link (Disallow: /search) in robots.txt?
I want to allow google search engine to index search results of my website, is this beneficial to my site? or does it harm?

Natashko
Posts: 3366
Reputation: 171

Message # 78 | 3:59 PM
khen, search engines do index your website (its entries etc). The only thing, which is not indexed are so-called "service" pages (like registration page)
khen
Posts: 475
Reputation: 13

Message # 79 | 5:51 AM
Natashko, why do Disallow: /search is not allowed to be indexed or visited by a google bot?

Added (2011-01-19, 11:51 PM)
---------------------------------------------
up...up...up to be seen.


Natashko
Posts: 3366
Reputation: 171

Message # 80 | 2:31 PM
khen, "Disallow: /search" doesn't allow search page indexation. Just like I said before, "service" pages (like registration page and search page) are not indexed) It is done to make the indexation of other page (with content) go faster
fame
Posts: 3
Reputation: 0

Message # 81 | 11:24 PM
help me please how to know that site is 30days old, sothat i an edit robots.txt file. ???

http://glazayevrope.ucoz.com/


http://canecorsoturkey.ucoz.com/
Post edited by fame - Monday, 2011-02-21, 11:28 PM
Natashko
Posts: 3366
Reputation: 171

Message # 82 | 2:37 PM
fame, please, read the first post carefully. Address of the robots.txt file is http://your_website_address/robots.txt In this way you will be able to check whether the quarantine is over or not.
If you still want to unblock the robots.txt file for your website to be indexed sooner, you can pay for any of premium packages in the website Control Panel -> Top Bar -> $ -> Paid services.
Please see the screenshot: http://faq.ucoz.com/screenshots/paid_service_packages.png
khen
Posts: 475
Reputation: 13

Message # 83 | 1:38 PM
My question is already answered. Thanks... just find it a while ago.
Post edited by khen - Wednesday, 2011-02-23, 1:39 PM
adinbatsin
Posts: 6
Reputation: 0

Message # 84 | 10:32 AM
Hi I am from Turkey do not speak English "google translate" to write from my site / robots.txt file, "User-agent: * Disallow: /" in google and seen way prevented from entering? To delete the contents of the file on ftp standing Creating new file does not change I know understands the language, explain the reason for this is? Help me please

Added (2011-02-24, 4:32 Am)
---------------------------------------------
I have read the entire forum Certainly active after 30 days? This time frame varies traffics? google blocking traffic does not occur? surprised surprised

note: I do not know English and I turned to google wacko wacko

Natashko
Posts: 3366
Reputation: 171

Message # 85 | 1:30 PM
adinbatsin, please, provide a website name for us to be able to check.
adinbatsin
Posts: 6
Reputation: 0

Message # 86 | 7:26 PM
You want your web site name like telling http://mywep.ucoz.com/robots.txt see I have not uploaded your own lies at this .. I can not delete also (
fame
Posts: 3
Reputation: 0

Message # 87 | 10:24 PM
adinbatsin robot.txt dosyasını kendin yüklediysen sil ve bbir gün bekle 30 günün dolduysa sistem kendisi robot dosyası koyuyor
http://canecorsoturkey.ucoz.com/
adinbatsin
Posts: 6
Reputation: 0

Message # 88 | 8:14 PM
fame tskler (thank you) türkmüsün ? kardes

benim 30 gün daha dolmadi 30 gün sonra kendisi otamatik yüklüyormu tam onu anlamamistim siteyi hangi gün actıgımı tam hatırlamıyorum ama bekleriz bi 30 gün daha... tskler cevapin icin

Animorph
Posts: 2856
Reputation: 189

Message # 89 | 8:24 PM
adinbatsin, please talk english otherwise nobody can help you
To busy building a passive income online ;)
adinbatsin
Posts: 6
Reputation: 0

Message # 90 | 9:21 PM
Animorph, I'm sorry,
uCoz Community » For Webmasters » Site Promotion » Indexing Policy & Robots.txt
Search: