Speaker: Cherry Prommawin
TL;DR
- Explain how search engine works
- Benefital for those working in SEO
Steps of Result Generation
crawling 檢索
- Discover new URLs in the internet
- If you have any new sub-page, sub-domain, setting internal links in the indexed sites is important
- if an page has multiple internal links, the page would be checked by bot more often
- 檢索器 crawler: the one doing the collection
- a.k.a Google Bot
- Factors impact crawling
- speed of loading
- quality of content
- potential server error
- other signals
- How to make crawler crawl?
- LESS: HTTP status: Returning 500, 503, 429
- MORE: Avoid returning errors, improve site quality
- make people feel site important
- make hot page as internal links
- How to make certain pages not crawled?
- robots.txt
- head meta tag
robots in HTML
- Process of crawling
- Fetching and rendering
indexing 索引
- Definition: Identify whether save the crawled pages’ info into the database
- Process
- Parsing the HTML
- Understanding the content 了解網頁、計算信號
- 每個頁面中,關鍵字應該放多少?→ 不是越多越好
- Meta tag →
keyword meta tags are not used
- Will images be understood by Google bot?
- Use attribute text:
alt with text in img
- Canonical: Avoid duplication by clustering
- 將重複的頁面建立叢集
- 中文:建立標準頁面
- rel canonical in head
- Index selection
- Choose the pages with good quality based on the understanding of bot
- How to know if the sites are indexed?
site:example.com in search bar
- Using search console is more accurate
serving 提供搜尋結果