Comments
Transcript
Building an IBM Watson Solution IBM Watson Version 2
IBM Watson Version 2 Building an IBM Watson Solution IBM Confidential 23 January 2015 IBM Watson Version 2 Building an IBM Watson Solution IBM Confidential 23 January 2015 IBM Confidential Notes Before using this information and the product it supports, be sure to read the general information under Notices at the end of this information. Prerelease information disclaimer: This document contains proprietary information. All information contained herein shall be kept in confidence. None of this information shall be divulged to persons other than (a) IBM employees authorized by the nature of their duties to receive such information, or (b) individuals with a need to know in organizations authorized by IBM to receive this document in accordance with the terms (including confidentiality) of an agreement under which it is provided. This edition applies to Version 2 Release 3 of IBM Watson. This information applies to all subsequent releases and modifications until otherwise indicated in new editions. This edition replaces Version 2 Release 1. © Copyright IBM Corporation 2014, 2015. US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. IBM Confidential Contents Chapter 1. IBM Watson partner enablement . . . . . . . . . . . . . 1 Chapter 2. Helping to teach Watson . . . 5 Establishing user roles . . Building a Watson solution . . © Copyright IBM Corp. 2014, 2015 . . . . . . . . . . . . . . . 7 . 9 Evaluating questions and content resources. . . . 15 Notices . . . . . . . . . . . . . . 19 Trademarks . . . . . . Privacy policy considerations . . . . . . . . . . . . . . . 20 . 20 iii IBM Confidential iv IBM Watson Version 2: Building an IBM Watson Solution IBM Confidential Chapter 1. IBM Watson partner enablement Planning a strategy to enable and build an IBM Watson™ solution or integrating Watson™ into a solution entails a thorough understanding of the set of processes for designing, developing, and delivering a Watson solution. It includes an engagement process for working on the high level planning and scoping a Watson solution, tooling to help provide data that is used to train Watson, and a methodology for developing applications for users to Ask Watson questions or other types of requests, such as diagnosis or treatment advice. IBM Watson is a natural language question answering system. It determines its answers and associated confidence scores based on the knowledge it has acquired. Watson solutions provide a combination of the following key characteristics: v Understand questions through natural language Watson provides natural language processing to help understand the complexities of human communication both a means of user interaction and to extract knowledge from sources of data v Generate and evaluate hypotheses, answers and supporting evidence Watson provides hypothesis generation and evaluation by applying advanced analytics to weigh and evaluate a panel of responses based on only relevant evidence. Watson is an adaptable system that can be customized for specific application domains and questions. After defining use cases for a Watson solution, developers and domain experts can help train Watson for a domain-specific solution by supplying the content resources and the training data that is required. User interface developers can use the Watson SDK to create Watson enabled applications to support specific kinds of users, contexts, or use cases. Questions are submitted through a Watson enabled application to a Watson pipeline. The pipeline initiates the process for analyzing, searching, and returning information to a user. The information that is returned is based on the data in the knowledge base, analytics algorithms, and machine learning models. The question analysis involves searching and generating large amounts of data, and requires hardware and software resources to support a massively parallel computing environment. Watson use cases or solutions can be applied to different kinds of scenarios such as the following ones: v Ask - Users can ask Watson direct questions in natural language the same way they ask friends or colleagues questions. v Discover - Users can discover new insights or find the rationale for a given response with Watson. For example, a researcher might be looking for the best way to treat a disease for a specific group of patients. v Decide - Users might use Watson to help them decided on the best course of action. For example, a user might be looking for confidence-based recommendations for their next action when they have many options to chose from such as what course of treatment to prescribe to a patient or what investment choice to make. © Copyright IBM Corp. 2014, 2015 1 IBM Confidential Natural language processing Natural language is the real language that people use to naturally capture and communicate knowledge. In solving natural language questions, Watson must evaluate puns, slang, jargon, and acronyms to determine its confidence in returning an answer. Whereas a web search engine returns a ranked list of web pages based on keywords, Watson analyzes the structure and wording of questions being asked, and quickly formulates an answer that has the highest level of "confidence" is correct. A search engine becomes less accurate as more information is introduced to the search, but Watson is able to use additional information to become increasingly accurate and precise the more data it is given. Watson includes many algorithms to analyze natural language processing. It also has algorithms that enable it to learn, so it remembers what it learned to apply that information to the next analytical task it must conduct (for example, when a person says "fluid," the person can mean "liquid" as well). Unstructured data Unstructured information represents the largest, the most current, and the fastest growing source of information that is available to businesses and governments. By using Watson, you can create applications that can analyze large volumes of unstructured information to discover, organize, and deliver relevant knowledge to specific types decision makers. Currently, Watson uses textual data only. Unstructured data must be analyzed to interpret, detect, and locate concepts of interest that are not explicitly tagged or annotated in the original document. Documents can include any kind of domain-specific information that you define as such. Steps to building Watson solutions and services At a high level, there are different steps to building components in a Watson solution or service. For example, the following image shows some of the possible ways to be part of a Watson ecosystem: v Define a use case for which users submit questions to Watson for answers, advice, evidence, or information. 2 IBM Watson Version 2: Building an IBM Watson Solution IBM Confidential Have people provide questions that they expect users of the system to ask Watson. These questions will help drive and identify the content that needs to be made part of the Watson knowledge source. v Identify and upload content resources to form the knowledge source for Watson to answer questions. v Help train Watson for a specific domain by creating training data. v Configure the user interface of an Advisor application or an integration for users to ask Watson questions. The following image shows some of the possible workflows in a Watson ecosystem. Each hosted instance of Watson is made available for developers, subject matter experts, and administrators to perform tasks that can build or train components of a Watson solution or service. There are two different modes of operation for a Watson solution or service. Each of these modes is made available as its own hosted instance of Watson. v Customization mode is for developing a Watson solution or service. The Watson core is a domain-independent implementation that developers customize by adding new algorithms and providing relevant content that Watson can incorporate into its base of knowledge. Its ability to answer questions based on this customization can be verified and refined over time as the information changes or new analytics become available. Customization environments are where iterative system configuration, development, and testing of all components occurs. It includes the following kinds of tasks: Chapter 1. IBM Watson partner enablement 3 IBM Confidential – Uploading and ingesting content resources to form the knowledge source to answer questions. – Providing training data to help train a system. – Using the Watson SDK for developers to build Watson enabled applications for users to submit questions. UI developers can use the Watson SDK to build web or mobile applications that enable users to submit questions or can create Watson services and integrate them into existing applications or portals. Development tasks might also include customizing a data model, defining training data, creating and running experiments through questions and answers to train or test a system, analyzing test results to assess precision and accuracy, and creating or making improvements to system components. The goal in Watson system design and development is to deliver solutions that enable: – Effective analysis: Understand and process rich natural language questions over a broad domain of knowledge. – Precise answers: Determine what is being asked and give precise responses. – Accurate confidence: Determine likelihood that the answer is correct. – Consumable justifications: Explain why the answer is right. – Fast response times: Return answers with precision and confidence quickly. Actual response times will be different for each solution and based on several factors. v Production mode is for enabling users to submit questions and receive answers. Production environments are for systems that are deployed for users of solutions or services. The system could be configured for specific governance, security, or compliance requirements. The question answering might be for one person or for many concurrent users and questions. Answer-confidence is based on how the system is designed. For example, one solution might be for only returning a single answer and greater than 95% confidence and another solution could let a user specify the number of top answers with supporting evidence to return based on a confidence level the user can specify. 4 IBM Watson Version 2: Building an IBM Watson Solution IBM Confidential Chapter 2. Helping to teach Watson Help train and teach Watson by supplying content resources to add to its knowledge content and creating training data in the form of questions and answers. IBM Watson is a cognitive computing solution that relies on users to help train the system rather than explicitly programming it to run tasks. Watson can be improved through interactions with users and with the addition of data to the system. An IBM Watson solution brings together a set of transformational technologies to drive optimized outcomes: v Processes large amounts of data quickly. v Understands natural language. v Generates answers with a confidence level. v Returns information with a confidence level that is based on evidence that is found in content resources. v Adapts to the responses a user selects. To analyze and answer questions, Watson relies on: v Content resources, such as a set of documents, to be uploaded and used to form a knowledge store that Watson uses to find answers and supporting evidence to questions. Content resources are existing documents that contain knowledge about the domain topic (such as Product guidelines, specifications, or policies). These resources might be in the form of unstructured (natural language text) or structured (database) formats. Examples include a product manual or a table that describes different types of specifications. Ingestion refers to the processing of content resources such as a corpus of documents from which Watson finds answers, advice, or evidence for questions. The ingestion process includes natural language preprocessing of text corpora and the creation of derived data resources such as search indexes and a knowledge base. v Question and answer pairs that form the training data that is used to help train Watson to answer questions. Training data is a set of representative question and answer pairs that explicitly link expected input to expected output. That is, users who understand the target domain, create questions and link them to correct answers that are found in the uploaded content resources. Training data represents a sample for the intended system function. It is used to train and test the system performance. Watson benefits by improving the quality of the training data or adding more content. If more content resources are added to the system, then current question and answer pairs might need to be modified. Watson Experience Manager is a suite of tools that is provided in a customization environment to enable each of the following kinds of tasks: v Creating questions to help determine the target Watson solution. © Copyright IBM Corp. 2014, 2015 5 IBM Confidential The Question input tool in IBM Watson Experience Manager is for users to submit questions that represent the kinds of questions that users of the target solution might ask. v Adding content resources to form the knowledge content to answer questions. Reviewing representative questions can help identify the content resources that are needed to answer them. Content is added by using the Corpus management tool in IBM Watson Experience Manager. v Creating and managing the training data by creating the kinds of question and answer pairs that are similar to the questions that users would submit to Watson for answers. The process to create training data is to submit questions and specify one or more correct answers for each question. The answers must be available in the content resources that provide supporting evidence for the answers. IBM Watson Experience Manager provides an Expert training tool to manage this process. Questions that are submitted by users of the Question input tool appear in the Expert training tool and can be used as the initial list of questions to work with. Additional tasks: v Administering the user roles for controlling access to the tasks to perform in the tool to build knowledge content and training data. For example, user roles specify who can: – Submit questions – Add content – Create and deploy a corpus – Specify answers – Approve questions and answers v Configuring the user interface of the Advisor application for users of a production system to submit questions. Training Watson is trained much the way a human learns. It learns from historical question and answer pairs to understand how questions are asked and how the answers can be uncovered in the evidence. Watson then uses sophisticated machine learning algorithms to "get smarter" over time. Training data is the collection of questions and answers that are representative of questions that are asked by users of a Watson solution. Subject matter or domain experts create these representative questions and answers by using the Expert training tool in Watson Experience Manager. This training data will help Watson understand the language characteristics of the domain that Watson is operating on. This process is similar to what a person would follow if they were new to an environment, that is, practice by asking and answering some questions to become familiar with the way concepts are expressed. Reviewers and approvers of training data for a Watson project might assign a particular user to author a collection of documents. The collection might cover a subset of the complete corpus of documents that Watson uses to answer questions for a domain. As a domain expert, a user might be assigned a target number of questions that need answers. The goal in creating questions and answers is not to imagine every possible question and answer that a user might ask. Instead, the 6 IBM Watson Version 2: Building an IBM Watson Solution IBM Confidential goal is to build a representative set that enables Watson to understand the domain's language characteristics. Learning Just as people continue to learn after their formal education, Watson continues to learn from experience after it is deployed. In real life, people learn from feedback that they receive from others and from new material they read or ingest. Watson is no different. When a user confirms that one of the responses from Watson is correct, in particular, if that answer is one that Watson ranked second or third, and not the most likely answer, then Watson can use that feedback to learn. The system is able to capture these questions and correct answers and append them to the training set that was initially used to train Watson for the specific domain. It is not efficient for Watson to retrain on one question at a time. Watson batches new questions, and when the wanted threshold is met, a training process can be run to improve how Watson handles future questions. This cycle can continue on an ongoing basis so that Watson continues to perform more accurately. There are several variations on how to customize the way that Watson learns. For example: v Select users can be designated as Approvers. – Approvers can review the question answer pairs that are captured from users of the system to validate the correctness of the feedback. – Any feedback that is provided by these users would automatically be accepted for more learning. Feedback from all other users can either be reviewed, or discarded. – Approvers can either discard question answer pairs that are created through user feedback that are deemed to be incorrect or provide new answers to those questions. v Domain experts can improve training data by vetting questions where no correct answer was indicated. The cause for no correct answer might be because Watson did not have sufficient confidence to provide an answer or the user indicated that no correct answer was provided (by adding negative feedback and not selecting an alternative answer). For these kinds of questions, domain experts might identify the appropriate answer and add the question answer pair to the queue of new pairs. Questions deemed inappropriate by the reviewers or approvers would be deleted. Establishing user roles There are several different user roles and associated tasks for building a Watson solution. Start by first understanding and identifying the people who can fill the roles in the system development workflow. To apply the correct roles to your users, review the roles that are involved with Watson. Chapter 2. Helping to teach Watson 7 IBM Confidential Subject matter experts Subject matter experts (SME) help identify and upload content to form the knowledge content from which Watson finds answers. They also help create the training data in the form of question and answers. The subject matter expert has these responsibilities: v Understands the data resources and requirements for industry-specific and real-world question analysis and answering. v Understands the domain and the kinds of questions and answers for an actual solution. v Knows the terms and forms that users would typically use to ask questions. v Helps identify and upload the information that is needed to become part of the Watson knowledge content. The SME might also identify the types of domain adaptations (such as more content, training, and functional) for the question and answer data model. v Provides input to more resources for system usage and storage. These additional resources include domain-specific terms, words, and other information and features in the data, such as abbreviations, synonyms, acronyms, and semantic types. SMEs are domain experts who create question and answer pairs that form an answer key. The answer key is used to help train Watson so it is more familiar with a specific domain. After creating a question, the experts specify answers by selecting text from the knowledge content documents. The documents for SMEs to select answers from is based on a keyword search of the documents for terms in the question text. For example, if a SME submits the question, what is term insurance?, then the documents returned with potential answers are based on the keywords, in the question, such as term and insurance. At this stage of building a Watson solution, SMEs are providing training data to help Watson learn. However, they are not training Watson directly, nor is Watson answering the questions that are submitted. Rather, at this stage, when a SME submits a question, they see the results for the text passages searched and found across the content that was uploaded to this point. From the passage search results, the SME can specify answers by selecting one document at a time and highlighting text in it. Question and answer pairs are submitted by authors and reviewed by approvers. A subject matter expert might be an author, be an approver, or handle both tasks. Administrators Administrators are those people who are responsible for configuring the application and who have access to reports and metrics in IBM Watson Experience Manager. There are two types of administrator roles: v Customer administrators assign user roles and user permissions and configure the Advisor application that customers use to ask Watson questions. The user 8 IBM Watson Version 2: Building an IBM Watson Solution IBM Confidential role permissions establish what tasks that each user can perform in the supported tooling for building a solution. v System administrators configure the connection from user applications to a Watson pipeline. Administrators can monitor training progress and review content resources and generate usage reports to understand what content is being used by Watson to answer questions. User interface developers User interface developers can implement user applications by using the personalization page to customize the chat window for users, or integrating the application by using an iFrame. They can embed a Watson application within an existing application and initialize the chat window such that the right information is passed to it. They can also use the Watson Platform Software Development Kit to create applications by using the Question and Answer REST service and a set of JavaScript widgets. End users End users are the people who ask Watson questions by using the web and mobile interfaces of the Watson Advisor in a production environment. Users might also help provide representative questions and answers for training data, test the applications, and help teach Watson by providing feedback. Building a Watson solution Building a Watson solution is enabled by using a collection of supported tools and processes that a team of users performs to help train and teach Watson to become a domain-specific solution. IBM Watson Experience Manager is a suite of tools to help enable the process to build a Watson solution. Experience Manager provides tools for each of the different user roles to do their job. Watson Experience Manager is for domain and subject matter experts, administrators, managers, and user interface developers to train, customize, and monitor a Watson application. Watson Experience Manager applications are hosted and managed by the IBM® Cloud Technology and Support (WCTS) team , and available through a web interface. Each Watson instance is made available through a URL from WCTS. Watson Experience Manager includes tools that are visible to users based on their user roles that are assigned by an administrator: v Collecting representative questions Building a Watson Engagement Advisor solution begins with collecting representative questions. You need to collect representative questions from the intended users of the solution. The target Watson Engagement Advisor solution should be able to answer representative questions and thus the questions help drive the direction of the target Watson solution. This question set also helps you identify the content resources that need to be added to the Watson corpus to answer the questions. Subject matter experts should not create representative questions. Chapter 2. Helping to teach Watson 9 IBM Confidential A minimum of 2000 representative questions is required to help train Watson. A lower number of questions will not be enough to help achieve a high level of system performance. There are different methods to collect questions. Users can use the Question input tool that is part of Watson Experience Manager to submit questions. Questions that are submitted in the Question input tool are added directly to the Expert Training tool. Subject matter experts then use this tool to find answers to these new questions that need an answer. Then, the question answer pairs go through a review process before they can be approved and added as part of ground truth for training Watson. Representative questions can also be gathered from the Adaptive Learning tool to add questions that have been submitted by users of the Advisor user interface. Questions can also be collected in a spreadsheet and then imported into the tool. Adding more questions can help drive the creation or acquisition of more corpus content to be able to answer the questions. v Administering users for access to Watson Experience Manager System administration is for defining users and assigning user roles to them. Administrators can also generate user report metrics. Users see only the tools or options in the tools that they are authorized to view based on their assigned user roles. IBM system administrators can also use the tool to specify the connection to a Watson processing pipeline. v Creating a corpus Use the Corpus Management tool to add content to include as part of the Watson corpus. The corpus forms the answer store, or knowledge content, for Watson to find answers First, the corpus content required to answer the representative questions is identified. The content is then uploaded, preprocessed, indexed, and stored in the file system before it can be used to find and specify answers to questions in ground truth. v Creating or updating ground truth The Expert training tool is for creating question answer pairs that help teach Watson about your domain. These question answer pairs form the question store, and are also referred to as ground truth. After acquiring representative questions and creating a corpus, subject matter experts must match questions to similar questions or with answers. To answer questions, you must search and find, and then select the similar question of the correct answer. Searching for an answer is based on a keyword or filename search of the documents added to the Corpus management tool. Answers are specified by matching a question to a similar question, or to an answer unit. The possible answers are document segments that are formed by the content ingestion process. The Expert training tool provides support for managing a review and approval process for question answer pairs. Some users might be assigned the role to create question answer pairs while others are assigned to review and approve or reject pairs. v Configuring the user interface of the Advisor application Configuration is for configuring the look and feel of the Advisor application for users. 10 IBM Watson Version 2: Building an IBM Watson Solution IBM Confidential User interface developers or administrators customize the user interface for the Watson Advisor for users of the solution. They can configure the chat window that customers can use to ask Watson questions. UI developers can also design and implement Watson enabled applications or integrated services into an existing application for users to submit questions to Watson. v Testing the Advisor application The Test option in Watson Experience Manager is for testing the application that users see when the application is deployed to a production environment. The Advisor application is a chat window in which users can submit questions, view common questions, and view returned answers or responses. When asking questions, users should be aware that the wording of a question matters. How a questions is phrased can affect the answering results. For example, "what is", "what's", and "tell me about" are all ways to ask a question but Watson might not always return the same answer, nor with the same level of confidence for these variations, so it is good try variations to receive the best answer. For example, by asking "What is Watson?" there is little for Watson to analyze and to base its confidence scores on for a correct answer because "what is" and "Watson" are not enough domain-specific words to distinguish possible answers in corpus content with high confidence. Another example, of seeing different answers and confidence scores would be "How do you train a Watson solutions," versus "How do I train a Watson solutions," since "you" is ambiguous, and refers to Watson, not a user. v Improving accuracy through training and testing Ground truth is the collection of approved question and answer pairs in the Expert Training tool. Randomly generated subsets of ground truth are used to train and test the system. IBMers work with partners and customers to train and test a Watson Engagement Advisor solution: – They run training to train machine learning (ML) models to a specific domain. – They use the Watson Workbench to measure accuracy of a system by running a test experiment and analyzing the experiment results. This data shows accuracy, precision, and recall metrics and can help provide insight to what updates to the corpus or ground truth that might improve these accuracy metrics. Workflow for building a Watson solution The process for establishing, developing, and delivering a Watson solution involves three phases. Many of these steps are completed by working with IBM services representative and are not required in all instances. Services consultants and system administrators help configure and train a Watson solution by using the standard solution methodology that includes a set of best practices. Before users can submit questions to Watson from a Watson Advisor or chat window in a production environment, the system must first be trained and tested. From a tools and process perspective, training is making Watson more familiar with a particular domain. At a high level, the tasks to perform include the following: v Collect representative questions from users of the target solution. Chapter 2. Helping to teach Watson 11 IBM Confidential Collecting representative questions is the important first step drives the steps to build a Watson Engagement Advisor solution. Training a Watson solution starts by collecting representative questions that are submitted by the intended users of the target solution. It is important that the questions be questions that actual users of a target production system might ask. v Collect content for a corpus that provides answers to questions. Answers to questions must be available in the corpus. The answers to questions that reside in corpus documents must be correctly preprocessed to create meaningful document segments or answer units that subject matter experts will then be able to find and select to create the question answer pairs that are used to train and form the question store. The collection of question and answer pairs that have been reviewed and approved form the ground truth for a Watson solution. v Use the Corpus Management tool to add content that forms the corpus. The tool preprocesses documents to form answer units, or document segments, that are made available for mapping to questions by subject matter experts. The tool also allows corpus managers and administrators to view the answer units to be sure that documents have been preprocessed correctly. Some documents might need to be modified or a different document preprocessor might be needed. v Use the Expert Training tool to create a ground truth of approved question and answer pairs. – Find and match questions to similar questions or to answer units. – Match a question to similar questions forms a question cluster. Create question clusters to group questions that are paraphrases with the same answer. Clusters can help increase the accuracy and confidence of Watson to answer questions. – Find and match an answer to a question. Individual questions that are matched to an answer are known as singletons. A higher occurrence of singletons in a ground truth can lead to lower accuracy and confidence than by having question clusters. For example, a ground truth that includes multiple ways of asking the same question that are all part of a question cluster leads to better training for Watson. – Review and approve question and answer pairs. Only the Approved questions in the Expert Training tool form the ground truth that is used to train and test Watson. v Click the Create Corpus button in the Corpus Management tool to create a new question store and answer store, and to deploy the updates to the test environment to be able to test the system. v Test the current system from the Test user interface in Experience Manager, and then deploy to the production environment. v Run training to train the machine learning (ML) models to the specific domain for a solution. This step is currently performed by IBMers. Training is performed by using a random subset of the question and answer pairs in ground truth. The set is randomly generated one time and then used for each training run. v Run testing and perform accuracy and headroom analysis to measure current accuracy and identify changes to make to the corpus and ground truth based on the analysis to improve accuracy. 12 IBM Watson Version 2: Building an IBM Watson Solution IBM Confidential Testing is performed by running an experiment with the Watson Workbench. Testing is performed by using a random subset of the question and answer pairs in ground truth. The set is randomly generated one time and then used for each test experiment. v Continue iterating on this process to improve accuracy. Once a corpus is deployed to a production pipeline it can be made available for users of the Advisor interface to ask questions. At a high level, the process to enable a Watson solution involves the following phases: 1. Try Watson Use a sample Watson Advisor instance to ask Watson questions that are based on an initial set of uploaded documents. In this phase, you work with IBM Managed Services to complete the following steps: a. Upload an initial set of documents into a configuration mode instance of Watson. b. Assess the content to ensure an appropriate fit with Watson capabilities. c. Experiment by submitting sample questions by using the Watson Advisor. d. Optionally perform these tasks by using tools that are included in Watson Experience Manager: v Update the corpus by uploading more documents. v Customize the Watson Advisor user interface for users by adding logos, themes, and choosing other options. v View metrics that show ingestion results, question answering results, and recommendations for improvements to the solution. v Use the Watson SDK to create Watson applications for users by using the available APIs and JavaScript widgets. 2. Train Watson Use a Watson instance to build a domain-specific solution. a. Configure Watson. You can migrate the configuration and corpus from the trial instance or create a new configuration and upload new content and form a new corpus. b. Help build the Watson solution. 1) Collect representative questions. Have users of the intended solution to submit representative questions. Collect these questions and import them from a spreadsheet or have user enter the questions directly through the question input tool. The goal is to collect at least 2000 questions that represent questions that would be asked by users of the target system. 2) Create knowledge content by uploading documents. Use the corpus management tool in Watson Experience Manager to update the knowledge content by uploading more documents. Domain experts or content suppliers can help supply content resources by uploading them with Watson Experience Manager. The knowledge content is generated by an ingestion process for one or more sets of uploaded documents. You can also use the tool to delete content and view metrics on the ingestion process. 3) Create a ground truth to train Watson. Subject matter experts understand a specific domain and help train Watson by matching questions that are representative of what users Chapter 2. Helping to teach Watson 13 IBM Confidential might ask Watson to similar questions or to answers. These question and answer pairs are used to train Watson and establish a baseline system with initial levels of accuracy and precision. They can use the Expert Training tool in Watson Experience Manager to define, review, refine, and approve question and answer pairs. 4) Improve the quantity and quality of answers by refining ground truth or the corpus. By reviewing the ground truth to ensure that questions are grouped correctly and with correct and consistent answers subject matter experts can achieve higher levels of accuracy and precision. This step is an iterative process where incremental improvements can be achieved. 5) Configure the user interfaces of the Advisor application for the appropriate look and feel for the custom user experience of a specific solution. Use the configuration options in Watson Experience Manager to customize the Watson Advisor user interface for users by adding logos, themes, and choosing other options. User interface developers can integrate the Advisor into an existing application. They can also choose to design and implement other user interfaces or services by using the Watson SDK to add functions available in the Watson JavaScript toolkit and the Watson REST APIs. 6) Optionally: v Integrate a Watson Advisor into an existing environment. For example, you can create a more comprehensive business application by integrating a Watson application with existing information sources, process flows, and applications. v View metrics that show ingestion results, question answering results, training results, and recommendations for improvements to the solution. 3. Run a solution to a production environment for users to interact with Watson. A Watson instance can be provisioned to deploy the solution into a production environment. The run instance is where users can ask Watson questions through Advisors that can be made available through mobile and web user interfaces. Users can submit questions and provide feedback that is used to improve the Watson solution. Provision the run instance after all of the customization and testing is completed. The configuration of the train instance can be migrated to a run instance. v Deploy solution Integrate and deploy the solution to complete your business application. v Monitor use Monitor metrics and performance measures to understand system and content usage and performance and to look for opportunities for improvement. v Run solution Continue to update the knowledge content and training data to adapt to new or changing information and improve accuracy. 14 IBM Watson Version 2: Building an IBM Watson Solution IBM Confidential Evaluating questions and content resources To help ensure successful Watson Engagement Advisor solutions, start by understanding the kinds of questions it can answer and evaluating sample questions to see whether they are of a type that can be answered by Watson. Then, evaluate content resources to see whether they can be used by Watson for providing answers and evidence. You must also assure that the number and quality of question answer pairs that are gathered is adequate for training Watson. The following kinds of questions are currently supported: v Descriptive questions - Asking for a definition, explanation, or description. v Yes or No questions - Asking for a statement that indicates whether something is true or false. v Procedural how-to questions - Asking for the series of steps to accomplish a task. v Procedural troubleshooting questions - Asking for the series of steps to diagnose and resolve a problem. Watson Engagement Advisor cannot answer the following kinds of questions: v Predictions of the future unless predictions are in the content resources v Judgments v Requests for calculations v Requests for assimilation of information from various sources For more information: v “Guidelines for questions” v “Guidelines for content resources” on page 17 Guidelines for questions Use the following guidelines to evaluate the suitability of a specific solution. General v The answers must be found in available content resources. v Answers must be explicit in the content. Watson Engagement Advisor cannot synthesize answers. v Watson cannot yet combine information from various documents or create an answer that is a deduction from passages it finds. v Questions must not require Watson to make judgments. Descriptive questions v Questions can be approximately one to three sentences in length. Long paragraphs are not good questions. v Questions should require answers of a small paragraph (about three sentences) or less. Longer answers are allowed if the answer is a list that exists in the text. Descriptive question examples v Good Examples – Q: How much interest do I get from my Savings Deposit Program if I am on duty? Chapter 2. Helping to teach Watson 15 IBM Confidential A: Although the interest is taxable, the account pays a guaranteed interest rate of 10 percent during deployment. – Q: How do I find out about a DTAP briefing: A: Contact your local Transition Office to find out when a DTAP briefing is scheduled on your installation. If DTAP briefings are not available at your installation, the Transition Office Staff will refer you to other sources where similar information is available. v Bad Examples – Which insurance policy should I pick? Requires much information about the users and requires judgment. – How do I close this account? There needs to be enough context to know what the user is asking about. In this example, there needs to be information provided about which account. Yes or No questions v Same question length as Descriptive questions. v Need to provide enough contextual information for . Yes or No question examples v Good Examples – Q: As a renter, am I covered by my landlord's insurance policy? A: Landlords' policies typically cover the dwelling structure, contents they own and their liability. Their policy does not cover the goods or liability of a tenant. – Q: Can my EVP coverage be transferred if I sell my car? A: You can transfer the Extended Vehicle Protection plan if you sell your vehicle to a private purchaser. v Bad Examples – What is the best homeowner’s policy for me? Requires significant personalization information, rule evaluation. Currently, Watson uses limited information about the user and does not evaluate business rules. – Did the ABC policy cover flood damage in 1987? Watson cannot determine what was true at what time with a high degree of accuracy. Procedural questions v Same question length as Descriptive questions. v Answers are the length of a document section that documents a procedure. Procedural – How-to questions v Good Examples – Q: How do I change the ring tone on my iPhone? A: <Set of steps to change the ring tone.> – Q: How do you configure a new email provider in my Galaxy IIIS 16 IBM Watson Version 2: Building an IBM Watson Solution IBM Confidential A: <Set of steps to configure a new email provider> v Bad Examples – How can I make a million dollars? Must be answerable by specific information in a customer’s content. – How do I change my ring tone? - Watson cannot answer questions related to specific information about the user, such as what product they own, unless there is agreement to support specific client profile information. - The Watson Advisor dialog capability can address some missing information, but the support is limited. Procedural - Troubleshooting questions v Good Examples – Q: The email on my iPhone is frozen A: <Step by step procedure to determine why the email is frozen> – Q: I have a Galaxy S4 Why is my battery not lasting very long anymore? A: <Step by step procedure to determine why the battery is being drained> v Bad Examples – Q: My email is frozen As for How-to questions, Watson does not know what phone you are using for email, unless there is integration work to add that information to the question. – Q: I was charged too much last month. Watson does not address billing, sales, or account problems. Guidelines for content resources Evaluate content resources by following these guidelines: v Watson supports the following content types: HTM, HTML, PDF, DOC, DOCX, MHT, and ZIP. Within these documents, Watson can interpret Text and Tables. v v v v Watson cannot interpret Diagrams, Pictures, and other graphics, which are embedded video and audio, and Open form mathematical expressions. All content must be in English. US, UK, Canada, Australia, New Zealand can be supported if the questions are from the same locale. Content cannot be multilingual documents. Individual documents should not mix internal and public information. Unless the solution is an Agent-assist or internal-only use case. Internal content means information for inside a company use only. Documents should provide good context for the information they convey. Try to provide well-organized document structure with descriptive section headings. Watson can use some poorly structured documents, but if more than 25% are poorly structured, it is difficult to achieve good accuracy and precision. Watson supports finding answers from tables in documents, but these factors apply: Chapter 2. Helping to teach Watson 17 IBM Confidential – Tables can be complex and they are difficult to extract meaning from. – If documents are primarily tables, accuracy might be poor compared to other sources. – Content resources that include table intensive documents can be used. v Watson supports documents that are formatted in two columns, but with these notes: – Works correctly for HTML resources. – Some PDF files are difficult to interpret in multicolumn format. – Content resources that include many multicolumn PDF files can be used. 18 IBM Watson Version 2: Building an IBM Watson Solution IBM Confidential Notices This information was developed for products and services offered in the U.S.A. IBM may not offer the products, services, or features discussed in this document in other countries. Consult your local IBM representative for information on the products and services currently available in your area. Any reference to an IBM product, program, or service is not intended to state or imply that only that IBM product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any IBM intellectual property right may be used instead. However, it is the user's responsibility to evaluate and verify the operation of any non-IBM product, program, or service. IBM may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not give you any license to these patents. You can send license inquiries, in writing, to: IBM Director of Licensing IBM Corporation North Castle Drive Armonk, NY 10504-1785 U.S.A. For license inquiries regarding double-byte (DBCS) information, contact the IBM Intellectual Property Department in your country or send inquiries, in writing, to: Intellectual Property Licensing Legal and Intellectual Property Law IBM Japan, Ltd. 1623-14, Shimotsuruma, Yamato-shi Kanagawa 242-8502 Japan This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. IBM may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice. The following paragraph does not apply to the United Kingdom or any other country where such provisions are inconsistent with local law: INTERNATIONAL BUSINESS MACHINES CORPORATION PROVIDES THIS PUBLICATION "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions, therefore, this statement may not apply to you. IBM may use or distribute any of the information you supply in any way it believes appropriate without incurring any obligation to you. Licensees of this program who wish to have information about it for the purpose of enabling: (i) the exchange of information between independently created © Copyright IBM Corp. 2014, 2015 19 IBM Confidential programs and other programs (including this one) and (ii) the mutual use of the information which has been exchanged, should contact: IBM Corporation Bldg. 101/4th Floor/Dept. MU5A/Div F6 11301 Burnet Road Austin, TX 78758 U.S.A. Such information may be available, subject to appropriate terms and conditions, including in some cases, payment of a fee. The licensed program described in this document and all licensed material available for it are provided by IBM under terms of the IBM Customer Agreement, IBM International Program License Agreement, or any equivalent agreement between us. If you are viewing this information softcopy, the photographs and color illustrations may not appear. Trademarks IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at Copyright and trademark information at http://www.ibm.com/legal/copytrade.shtml. Java and all Java-based trademarks and logos are trademarks or registered trademarks of Oracle and/or its affiliates. Other company, product, and service names may be trademarks or service marks of others. Privacy policy considerations This IBM Watson Software Offering does not use cookies or other technologies to collect personally identifiable information. This offering uses cookies only to track user sessions and to carry authorization tokens. If additional software that is deployed by you the customer or Business Partner, along with this Software Offering, provides you the ability to collect personally identifiable information from end users via cookies and other technologies, you should seek your own legal advice about any laws applicable to such data collection, including any requirements for notice and consent. For more information about the use of various technologies, including cookies, for these purposes, see IBM’s Privacy Policy at http://www.ibm.com/privacy and the section titled Cookies, Web Beacons, and Other Technologies in IBM’s Online Privacy Statement at http://www.ibm.com/privacy/details Also, see the IBM Software Products and Software-as-a-Service Privacy Statement at http://www.ibm.com/ software/info/product-privacy 20 IBM Watson Version 2: Building an IBM Watson Solution Product Number: IBM Confidential Printed in USA