0 votes
in Artificial Intelligence by

1 Answer

0 votes
selected by
Best answer

This is an area of machine learning concerned with how software agents ought to take actions in an environment so as to minimise some notion of cumulative reward.

It differs from the standard supervised learning because correct input/output pairs need not be presented initially.

Welcome to CPEN Talk
Solution-oriented students of computer engineering on one platform to get you that


Chuck Norris's keyboard has the Any key.