2018年5月21日 星期一

crawling, bot,


Twitter bots may have altered the outcome of two of the world’s most consequential elections in recent years.
They played a small but potentially decisive role, researchers say.
BLOOMBERG.COM


An Internet Bot, also known as web robotWWW robot or simply bot, is a software application that runs automated tasks (scripts) over the Internet.[1] Typically, bots perform tasks that are both simple and structurally repetitive, at a much higher rate than would be possible for a human alone. The largest use of bots is in web spidering (web crawler), in which an automated script fetches, analyzes and files information from web servers at many times the speed of a human. More than half of all web traffic is made up of bots.[2]
Efforts by servers hosting websites to counteract bots vary. Servers may choose to outline rules on the behaviour of internet bots by implementing a robots.txt file: this file is simply text stating the rules governing a bot's behaviour on that server. Any bot interacting with (or 'spidering') any server that does not follow these rules should, in theory, be denied access to, or removed from, the affected website. If the only rule implementation by a server is a posted text file with no associated program/software/app, then adhering to those rules is entirely voluntary – in reality there is no way to enforce those rules, or even to ensure that a bot's creator or implementer acknowledges, or even reads, the robots.txt file contents. Some bots are "good" – e.g. search engine spiders – while others can be used to launch malicious and harsh attacks, most notably, in political campaigns.[2]




//透過網站爬蟲 (Web crawling) 技術,把創新及科技基金二十多年投資的五千多個科研項目數據從基金網站發掘出來和整理後,並以圖像方法展示數據。//

透過網站爬蟲 (Web crawling) 技術,把創新及科技基金二十多年投資的五千多個科研項...
THESTANDNEWS.COM

crawl

4[WITH OBJECT] Computing (Of a program) systematically visit (a number of web pages) in order to create an index of data:its automated software robots crawl websites, grabbing copies of pages to index

沒有留言:

張貼留言