詳解移除網址 - 第 3 集:移除不屬於您的內容
透過集合功能整理內容
你可以依據偏好儲存及分類內容。
2010 年 4 月 20 日,星期二
歡迎閱讀網址移除系列網站文章第 3 集!在第 1 集和第 2 集中,我們談論了如何加速移除由您控管的內容和要求加速移除快取內容。今天,我們來談談如何使用 Google 的公開網址移除工具,要求將不屬於您控管的網站上的內容從 Google 搜尋結果中移除。
Google 提供兩項可以加速移除內容的工具:
-
已驗證網址移除工具:如果您是網站管理員工具中的已驗證擁有者 (例如網誌或貴公司網站),當內容在您的網站上發布時,您就可以透過這項工具要求將內容從 Google 搜尋結果中移除。
-
公開網址移除工具:如果內容是發布在其他人的網站 (例如好友的網誌) 上,您就可以透過這項工具要求將內容從 Google 搜尋結果中移除。
有時候會發生這種情況:您想要移除的資訊來自非您所擁有或不是您控管的網站。由於每個網站管理員都會控管自家網站和網站內容,因此更新或移除 Google 搜尋結果的最佳做法,就是讓網站擁有者 (內容發布網站) 封鎖檢索網址、修改內容來源,或是完全移除網頁。如果內容並未變更,則下次檢索時,這些內容就會出現在 Google 搜尋結果中。因此,如要移除非自己網站上代管的內容,第一步就是與該網站擁有者聯絡,要求對方移除或封鎖相關內容。
已移除或封鎖的內容
如果網站擁有者移除了網頁,則針對已移除的網頁所提出的要求應該會傳回 404 Not Found
回應或 410 Gone
回應。如果網站擁有者選擇禁止搜尋引擎檢索該網頁,該應該在網站的 robots.txt 檔案中封鎖該網頁,或是家上一個 noindex
meta
標記。符合上述其中一項條件後,您可以透過「網站管理員已封鎖這個網頁」選項提交移除要求。
有時候,網站擁有者會聲稱自己已經封鎖或移除了某個網頁,但嚴格來說,卻未這麼做。如果對方聲稱某個網頁已到遭封鎖,您可以仔細檢查網站的 robots.txt 檔案,確認該網頁至否已列為封鎖網頁。
User-agent: *
Disallow: /blocked-page/
另一個檢查網頁是否遭到封鎖的地方,就是網頁的 HTML 原始碼本身。
您可以透過瀏覽器前往該頁面並選擇「查看網頁原始碼」,看看 HTML head
部分是否有中繼 noindex 標記?
<html>
<head>
<title>blocked page</title>
<meta name="robots" content="noindex">
</head>
...
如果網站擁有者告知您網頁已經移除,您可以使用 HTTP 回應測試工具 (例如 Firefox 瀏覽器的 Live HTTP Headers 外掛程式) 來確認網頁是否確實已經移除。啟用這個外掛程式後,您可以在 Firefox 瀏覽器中要求任何網址,測試該 HTTP 回應確實是 404 Not Found
或 410 Gone
。
已從網頁中移除的內容
一旦確認要移除的內容已經不再顯示於網頁上後,可以使用「已將內容從網頁中移除」選項來要求移除快取。這類移除方式通常稱為「快取」移除,可以確保 Google 的搜尋結果不會包含快取副本或舊網頁版本,或是任何來自舊版網頁的文字摘要。只有目前的更新網頁 (不含已移除的內容) 可以透過 Google 搜尋結果存取。不過,由於來自外部網站的導入連結仍然存在,因此目前的更新網頁可能仍然會根據舊版內容中的字詞進行排名。如果要求移除快取,系統會要求您輸入一個「已從網頁中移除的字詞」。請務必輸入目前實際網頁中不再顯示的字詞,這樣一來,我們的自動化程序才能確認網頁已經變更,否則系統會拒絕要求。「詳解移除網址」系列第 2 集詳細說明了快取移除作業。
移除 Google 安全搜尋篩選結果中顯示的不當網頁或圖片
Google 推出了安全搜尋篩選器,旨在提供可排除可能令人反感內容的搜尋結果。如果您發現您認為安全搜尋功能應過濾掉的內容,可以在日後要求將這類內容從安全搜尋結果中排除。透過「安全搜尋篩選結果出現不當內容」選項提交移除要求。
如果您在使用公開網址移除工具時遇到任何問題,或是在這裡找不到問題的解答,請將問題張貼到網站管理員說明論壇,或是前往我們的說明中心參閱更多詳細的移除操作說明。如果您在論壇中張貼問題,請記得使用網址縮短程序服務分享任何您想移除的內容連結。
此系列的其他文章
最後,我們也建議您參閱這篇文章,瞭解如何管理網路上與您有關的資訊。
作者:網站管理員趨勢分析師 Jonathan Simon
除非另有註明,否則本頁面中的內容是採用創用 CC 姓名標示 4.0 授權,程式碼範例則為阿帕契 2.0 授權。詳情請參閱《Google Developers 網站政策》。Java 是 Oracle 和/或其關聯企業的註冊商標。
[null,null,[],[[["\u003cp\u003eThis post focuses on Google's public URL removal tool, which helps remove content from search results when it's on a site you don't control.\u003c/p\u003e\n"],["\u003cp\u003eBefore using the tool, contact the website owner and request they remove or block the content, ensuring they return a 404/410 response or use robots.txt/noindex meta tag.\u003c/p\u003e\n"],["\u003cp\u003eIf content is removed from the page but still appears in search results, request a cache removal using the tool to update Google's index.\u003c/p\u003e\n"],["\u003cp\u003eYou can use the tool to request removal of inappropriate content appearing in SafeSearch filtered results.\u003c/p\u003e\n"],["\u003cp\u003eFor issues or questions, refer to the Webmaster Help Forum or detailed removal instructions in Google's Help Center.\u003c/p\u003e\n"]]],[],null,["# URL removal explained, Part III: Removing content that you don't own\n\nTuesday, April 20, 2010\n\n\nWelcome to the third episode of our URL removals series! In episodes one and two, we talked about\n[expediting the removal of content that's under your control](/search/blog/2010/03/url-removal-explained-part-i-urls)\nand\n[requesting expedited cache removals](/search/blog/2010/04/url-removals-explained-part-ii-removing).\nToday, we're covering how to use Google's\n[public URL removal tool](https://www.google.com/webmasters/tools/removals)\nto request removal of content from Google's search results when the content originates on a\nwebsite not under your control.\n\nGoogle offers two tools that provide a way to request expedited removal of content:\n\n1. Verified URL removal tool: for requesting to remove content from Google's search results when it's published on a site of which you're a verified owner in Webmaster Tools (like your blog or your company's site)\n2. Public URL removal tool: for requesting to remove content from Google's search results when it's published on a site which you can't verify ownership (like your friend's blog)\n\n\nSometimes a situation arises where the information you want to remove originates from a site that\nyou don't own or can't control. Since each individual webmaster controls their site and their\nsite's content, the best way to update or remove results from Google is for the site owner (where\nthe content is published) to either block crawling of the URL, modify the content source, or\nremove the page altogether. If the content isn't changed, it would just reappear in our search\nresults the next time we crawled it. So the first step to remove content that's hosted on a site\nyou don't own is to\n[contact the owner of the website](https://www.google.com/support/webmasters/bin/answer.py?answer=9109)\nand request that they remove or block the content in question.\n\nRemoved or blocked content\n--------------------------\n\n\nIf the website owner removes a page, requests for the removed\npage should return a\n[`404 Not Found` response or a `410 Gone` response](https://en.wikipedia.org/wiki/List_of_HTTP_status_codes).\nIf they choose to block the page from search engines, then the page should either be disallowed\nin the site's\n[robots.txt](/search/docs/crawling-indexing/robots/intro)\nfile or contain a\n[`noindex` `meta` tag](/search/docs/crawling-indexing/block-indexing).\nOnce one of these requirements is met, you can submit a removal request using the \"Webmaster has\nalready blocked the page\" option.\n\n\nSometimes a website owner will claim that they've blocked or removed a page but they haven't\ntechnically done so. If they claim a page has been blocked you can double check by looking at the\nsite's robots.txt file to see if the page is listed there as disallowed. \n\n```\nUser-agent: *\nDisallow: /blocked-page/\n```\n\n\nAnother place to check if a page has been blocked is within the page's HTML source code itself.\nYou can visit the page and choose \"View Page Source\" from your browser. Is there a meta noindex\ntag in the HTML `head` section? \n\n```\n\u003chtml\u003e\n\u003chead\u003e\n\u003ctitle\u003eblocked page\u003c/title\u003e\n\u003cmeta name=\"robots\" content=\"noindex\"\u003e\n\u003c/head\u003e\n...\n```\n\n\nIf they inform you that the page has been removed, you can confirm this by using an HTTP response\ntesting tool like the\n[Live HTTP Headers](https://addons.mozilla.org/en-US/firefox/addon/3829)\nadd-on for the Firefox browser. With this add-on enabled, you can request any URL in Firefox to\ntest that the HTTP response is actually `404 Not Found` or `410 Gone`.\n\nContent removed from the page\n-----------------------------\n\n\nOnce you've confirmed that the content you're seeking to remove is no longer present on the page,\nyou can request a\n[cache removal](https://www.google.com/support/webmasters/bin/answer.py?answer=59819)\nusing the 'Content has been removed from the page' option. This type of\nremoval---usually called a \"cache\" removal---ensures that Google's search results will not\ninclude the cached copy or version of the old page, or any\n[snippets](/search/docs/appearance/snippet)\nof text from the old version of the page. Only the current updated page (without the content\nthat's been removed) will be accessible from Google's search results. However, the current updated\npage can potentially still rank for terms related to the old content as a result of inbound links\nthat still exist from external sites. For cache removal requests you'll be asked to enter a \"term\nthat has been removed from the page.\" Be sure to enter a word that is not found on the current\nlive page, so that our automated process can confirm the page has changed---otherwise the\nrequest will be denied. Cache removals are covered in more detail in\n[part two of the \"URL removal explained\" series](/search/blog/2010/04/url-removals-explained-part-ii-removing).\n\nRemoving inappropriate webpages or images that appear in our SafeSearch filtered results\n----------------------------------------------------------------------------------------\n\n\nGoogle introduced the\n[SafeSearch](https://www.google.com/support/websearch/bin/answer.py?answer=510)\nfilter with the goal of providing search results that exclude potentially offensive content. For\nsituations where you find content that you feel should have been filtered out by SafeSearch, you\ncan request that this content be excluded from SafeSearch filtered results in the future. Submit\na removal request using the 'Inappropriate content appears in our SafeSearch filtered results'\noption.\n\n\nIf you encounter any issues with the public URL removal tool or have questions not addressed here,\nplease post them to the\n[Webmaster Help Forum](https://support.google.com/webmasters/community/label?lid=5489e59697a233d7)\nor consult the more\n[detailed removal instructions](https://www.google.com/support/webmasters/bin/answer.py?answer=164734)\nin our Help Center. If you do post to the forum, remember to use a\n[URL shortening service](https://www.google.com/search?q=url+shorteners)\nto share any links to content you want removed.\n\nOther posts of this series\n--------------------------\n\n- [Part I: Removing URLs and directories](/search/blog/2010/03/url-removal-explained-part-i-urls)\n- [Part II: Removing and updating cached content](/search/blog/2010/04/url-removals-explained-part-ii-removing)\n- [Part III: Removing content you don't own](/search/blog/2010/04/url-removal-explained-part-iii-removing)\n- [Part IV: Tracking requests, what not to remove](/search/blog/2010/05/url-removal-explained-part-iv-tracking)\n\n\nFinally, you might be also interested to read about\n[managing what information is available about you online](/search/blog/2009/10/managing-your-reputation-through-search).\n\n\nWritten by\n[Jonathan Simon](/search/blog/authors/jonathan-simon), Webmaster Trends Analyst"]]