Raw Record Source

{
  "$type": "site.standard.document",
  "description": "An update process updates relationship defining data by inputting, to a predetermined update map, a state of a vehicle obtained by a state obtaining process, a value of an action variable used to operate an electronic device, and a reward corresponding to an operation of an electronic device. A…",
  "path": "/patents/1288441",
  "publishedAt": "2021-04-22T00:00:00.000Z",
  "site": "at://did:plc:oql6ds5vnff4ugar6rruliwd/site.standard.publication/3mn3ohu7oxx5w",
  "tags": [
    "B60W10/06",
    "TOYOTA JIDOSHA KABUSHIKI KAISHA"
  ],
  "textContent": "An update process updates relationship defining data by inputting, to a predetermined update map, a state of a vehicle obtained by a state obtaining process, a value of an action variable used to operate an electronic device, and a reward corresponding to an operation of an electronic device. A range in which an operation process uses, as the action variable, a value different from a value that maximizes an expected return related to the reward is defined as a return non-maximizing range. In a case in which a degree of deterioration of the vehicle is greater than or equal to a predetermined degree, a changing process changes the return non-maximizing range to a side on which the return non-maximizing range is expanded as compared to a case in which the degree of deterioration is less than the predetermined degree.",
  "title": "VEHICLE CONTROLLER, VEHICLE CONTROL SYSTEM, VEHICLE LEARNING DEVICE, VEHICLE LEARNING METHOD, AND MEMORY MEDIUM"
}