Recently Updated Pages
Uniform-cost search
When actions have different costs, an obvious choice is to use best-first search where the evalua...
02 Data Wrangling Process
Data Wrangling is the process of transforming “raw” data into data that can be analyzed to gener...
03 Scraping
Web crawling, data crawling, and web scraping are all names to define the process of data extrac-...
01 Introduction to API
Data ingestion Data ingestion is the first and fundamental step of any Data Analysis Pipeline. Th...
02 Cassandra
Apache Cassandra is a highly scalable, high-performance distributed database designed to handle l...
04 A map of NoSQL technologies
Key-Value Store A key that refers to a payload (actual content / data). E.g. MemcacheDB, Azure T...
Backtracking Search for CSPs
Sometimes we can finish the constraint propagation process and still have variables with multiple...
Introduction
A logical agent is an agent that is capable of using logical sentences to represent knowledge of ...
03 ER Exercises
Exercise 1 Design an ER Model for a car rental system that manages the customers, the cars and th...
Introduction
Factored representation for each state: a set of variables, each of which has a value. A problem ...
Stochastic games
Many games include unpredictable stochastic events, like throwing of dice in backgammon We can ap...
Introduction
In this chapter we cover competitive environments, in which two or more agents have conflicting g...
Summary
Bidirectional search
Simultaneously searches forward from the initial state and backwards from the goal state(s), hopi...
Introduction
Preliminaries A search algorithm takes a search problem as input and returns a solution, or an in...
Flume
Apache Flume is a distributed, reliable, and available software for efficiently collecting, aggre...
Sqoop
Sqoop is a command-line interface application for transferring data between relational databases ...
Storm
Apache Storm is a distributed stream processing computation framework. Storm provides realtime co...
Impala
Apache Impala is an open source massively parallel processing (MPP) SQL query engine for data sto...
Hive
Apache Hive is a data warehouse software project built on top of Apache Hadoop for providing data...