uCoz Community » For Webmasters » Site Promotion » Indexing Policy & Robots.txt
Indexing Policy & Robots.txt
Sunny
Posts: 9296
Reputation: 456

Message # 1 | 10:41 AM
Website's Indexing Status


All uCoz websites have Indexing status that is displayed at the top of the Control Panel's main page (/panel/?a=cp). The parameter shows whether indexing by search engines is allowed for the website or not (whether the website is in quarantine).
The indexing status can show one of the two options: "indexing is allowed (quarantine is removed)":



Or "indexing is prohibited (the website is in quarantine)":



The status "indexing is prohibited (the website is in quarantine)" is assigned by default to all newly created websites.

Quarantine Removal Policy


A website can become available for indexing either automatically (if a premium plan is purchased) or upon the website owner's request. If the website does not have a premium plan and the user wants the quarantine to be removed, a request should be submitted from the website's Control Panel:



There will be a pop-up window with the info on the quarantine policy:



After the request has been submitted, the website will be checked automatically according to a number criteria: the website's age, presence of a custom domain name, content, verified phone number etc. On the basis of these criteria the system decides whether the quarantine should be removed. We cannot provide a more detailed description of the algorithm.

Note! If the quarantine removal was denied, the next request can be submitted no sooner than in 7 days.


Robots.txt


A website's robots.txt file is located at http://your_website_address/robots.txt. A website with the default robots.txt is indexed in the best possible way – we set up the file in such a way that only pages with content are indexed, and not all existing pages (e.g. login or registration page). Therefore uCoz websites are indexed better and get higher priority in comparison with other sites where all unnecessary pages are indexed.

That's why we strongly recommend not to replace the default robots.txt by your own.


If you still want to replace the file by your own, create a text file using Notepad or any other text editor and name it "robots.txt". Then upload it to the root folder of your website via File Manager or FTP. Note: while website indexing is prohibited, no modification of the robots.txt file is possible.

The default robots.txt looks as follows:
Quote

User-agent: *
Allow: /*?page
Allow: /*?ref=
Allow: /stat/dspixel
Disallow: /*?
Disallow: /stat/
Disallow: /index/1
Disallow: /index/3
Disallow: /register
Disallow: /index/5
Disallow: /index/7
Disallow: /index/8
Disallow: /index/9
Disallow: /index/sub/
Disallow: /panel/
Disallow: /admin/
Disallow: /informer/
Disallow: /secure/
Disallow: /poll/
Disallow: /search/
Disallow: /abnl/
Disallow: /*_escaped_fragment_=
Disallow: /*-*-*-*-987$
Disallow: /shop/order/
Disallow: /shop/printorder/
Disallow: /shop/checkout/
Disallow: /shop/user/
Disallow: /*0-*-0-17$
Disallow: /*-0-0-

Sitemap: http://forum.ucoz.com/sitemap.xml
Sitemap: http://forum.ucoz.com/sitemap-forum.xml



Robots.txt during the quarantine looks as follows:

Quote

User-agent: *
Disallow: /




Robots.txt FAQ


Informers are not indexed because they display information that ALREADY exists. As a rule this information is already indexed on the corresponding pages.


Question: I have accidentally messed up robots.txt. What should I do?

Answer: Delete it. The default robots.txt file will be added back automatically (the system checks whether a website has it, and if not – adds back the default file).


Question: Is there any use in submitting a website to search engines if the quarantine hasn't been removed yet?

Answer: No, your website won't be indexed while in quarantine.


Question: Will the robots.txt file be replaced automatically after the quarantine has been removed? Or should I update it manually?

Answer: It will be updated automatically.


Question: Is it possible to delete the default robots.txt?

Answer: You can't delete it, it's a system file, but you can add your own file. However, we don't recommend to do this, as was stated above. During the quarantine it is impossible to upload a custom robots.txt.


Question: What should I do to forbid indexing of the following pages?
_http://site.ucoz.com/index/0-4
_http://site.ucoz.com/index/0-5

Answer: Add the following lines to the robots.txt file:
/index/0-4
/index/0-5


Question: I have forbidden indexing of some links by means of robots.txt but they are still displayed. Why is it so?

Answer: By means of robots.txt you can forbid indexing of pages, not links.


Question: I want to make some changes in my robots.txt file. How can I do this?

Answer: Download it to your PC, edit it and then upload it back via File Manager or FTP.

I'm not active on the forum anymore. Please contact other forum staff.
nmrs
Posts: 5
Reputation: -2

Message # 121 | 7:00 PM
ok i have to wait 30 day sad
ok thanks guys
DEEPKG
Posts: 316
Reputation: 8

Message # 122 | 7:03 PM
nmrs, Good luck with Your Site till That u can work on your site design ....content .....many.........Things.....to do smile Bro Don't worry
I Try to help. U can Try to give Rep ++ For my try :P
ALexXL9345
Posts: 3
Reputation: 0

Message # 123 | 6:28 AM
hi all, can someone tell me why my robots ar Disallow www.watch-movies.ucoz.com/robots.txt on other website in ucoz the robots ar diferrent.....
Paradox
Old Guard
Posts: 3284
Reputation: 145

Message # 124 | 6:32 AM
ALexXL9345,

Quote (Sunny)
There is a "quarantine" for each new website when no modification of the robots.txt file is possible. In case of good traffic the quarantine will last up to 2 weeks, for the sites with low traffic – 30 days. If you pay for any of the additional services quarantine will end immediately after the payment.

Jack of all trades in development, design, strategy.
Working as a Support Engineer.
Been here for 13 years and counting.
ALexXL9345
Posts: 3
Reputation: 0

Message # 125 | 7:10 AM
when you say " good traffic" how many visits per day?
Paradox
Old Guard
Posts: 3284
Reputation: 145

Message # 126 | 7:32 AM
ALexXL9345, I am not sure on that number personally although I would hazard a guess at somewhere between 80-200 visits a day from what I've seen.
Jack of all trades in development, design, strategy.
Working as a Support Engineer.
Been here for 13 years and counting.
ALexXL9345
Posts: 3
Reputation: 0

Message # 127 | 10:03 AM
fnks
Xpress7866
Posts: 2
Reputation: 0

Message # 128 | 12:01 PM
Hello, i'm new here, and i am in need of some clarification regarding Robots.txt. I've read the Robots.txt thread in the section named Site Promotion, and about these subjects:

"There is a "quarantine" for each new website when no modification of the robots.txt file is possible."

"I have accidentally corrupted robots.txt. What should I do?
Delete it. Our robots.txt file will be added automatically (the system checks whether a user has it, and if not – adds back the default file)."

So i've modified the robots.txt file (meaning i added a new one in the admin page > File Manager), but because i thought that i made a mistake, i deleted it after a few days. That was a week ago.. Will my quarantine perioud be extended ( another 30 days after i modified the Robots.txt file)?

This is my site: http://xpresstuning.ucoz.com
Sunny
Posts: 9296
Reputation: 456

Message # 129 | 9:10 AM
Xpress7866, no, the quarantine period won't be extended.
I'm not active on the forum anymore. Please contact other forum staff.
Xpress7866
Posts: 2
Reputation: 0

Message # 130 | 12:00 PM
Ok. Thanks.
cyberworlds
Posts: 77
Reputation: 1

Message # 131 | 1:38 PM
i get message

Restricted by robots.txt 107


whats happen whit my robots.txt

nonames.at.ua/robots.txt

Sunny
Posts: 9296
Reputation: 456

Message # 132 | 1:58 PM
cyberworlds, where exactly do you get this message? Your robots.txt looks ok.
I'm not active on the forum anymore. Please contact other forum staff.
cyberworlds
Posts: 77
Reputation: 1

Message # 133 | 2:22 PM
Quote (Sunny)
cyberworlds, where exactly do you get this message? Your robots.txt looks ok.




how to fixed. please help me with robots.txt or meta taq

my meta taq

Code
<meta content='follow, all' name='Scooter'/>
<meta content='follow, all' name='msnbot'/>
<meta content='follow, all' name='alexabot'/>
<meta content='follow, all' name='Slurp'/>
<meta content='follow, all' name='ZyBorg'/>
<meta content='follow, all' name='Scooter'/>
<meta content='Global' name='Distribution'/>
<meta content='General' name='Rating'/>
<meta content='all,INDEX,FOLLOW,noodp,noydir' name='Robots'/>
<meta content='follow, all' name='Googlebot-Image'/>
Attachments: 9557484.jpg (47.0 Kb) · 5219254.jpg (145.4 Kb) · 1420870.jpg (37.4 Kb)

Post edited by cyberworlds - Monday, 2012-03-05, 2:25 PM
Sunny
Posts: 9296
Reputation: 456

Message # 134 | 2:53 PM
cyberworlds, there is nothing wrong with your robots. The restricted pages on your screenshot are pages, unnecessary for indexation, like the login and the registration page.

Quote (Sunny)
A website with the standard robots.txt is indexed in the best possible way. We adjusted it in such a way that only pages with content are indexed, and not all existing pages (e.g. login or registration page). Therefore uCoz websites are indexed better and get higher priority in comparison with other sites where all unnecessary pages are indexed.

I'm not active on the forum anymore. Please contact other forum staff.
donce
Posts: 41
Reputation: 0

Message # 135 | 8:52 PM
why when I search my robots.txt adress they show me this

User-agent: *
Disallow: /

What to do to google see my site?

http://filmofil.ucoz.com/
uCoz Community » For Webmasters » Site Promotion » Indexing Policy & Robots.txt
Search: