OCI Document Understanding is an AI service that enables developers to extract text, tables, and other key data from document files through APIs and command-line interface tools.
data:image/s3,"s3://crabby-images/35ef6/35ef65eeb575330aff2ebad411aa59e4f99eea88" alt="ai ai"
In this blog, I will show you how we can visualize invoices in Oracle Analytics using AI document understanding pre-trained model.
Pre-requisite:
1. Upload sample invoices in OCI bucket object storage
2. Create an Oracle Analytics Could Instance in OCI
Steps:
1. Create a CSV file:
Once we have all invoice images in the bucket, we have to create a CSV file including your bucket URL.
something like below.
data:image/s3,"s3://crabby-images/5264d/5264d68cf323abae686569da5e4e38f0c2d82108" alt="ai2 ai2"
2. Register model in Oracle Analytics Cloud (OAC):
Go to the Analytics home page and click on 3 dots in top right corner select Register Model/Function and then select OCI Document Understanding Models
data:image/s3,"s3://crabby-images/0ba3e/0ba3e073a25b724c6a1cbbb396c1a84f616f5873" alt="ai3 ai3"
Once selected, create a new connection by clicking the Create connection button. I already have a connection so I select that.
data:image/s3,"s3://crabby-images/38c87/38c870b2aecb0c691c1e25b1e44702baa93f763e" alt="ai4 ai4"
On the next screen, you have to select Model, I select Pretrained Document Key Value Extraction and select compartment, bucket (where we upload all invoices), document type (I select invoices), and give any name to Model.
data:image/s3,"s3://crabby-images/08829/08829a8f245d86018076ccf19b450cd82857886c" alt="ai5 ai5"
3. Apply AI Model to our images:
On the Analytics homepage, click on the Create button from the top right corner and click Data Flow.
data:image/s3,"s3://crabby-images/dd2e2/dd2e2a37102143c0cbff3ce4a2f2c484b3b15e1e" alt="ai6 ai6"
On this screen, we have to choose the CSV file we created in Step 1 and click add.
data:image/s3,"s3://crabby-images/a500a/a500a92092b512aa2122a19880768ad09e887ab1" alt="ai7 ai7"
You will see the below screen.
data:image/s3,"s3://crabby-images/a7a44/a7a44213f7c36a16494fa5609da3f1e8d6e61ab1" alt="ai8 ai8"
Now click on the + sign click on Apply AI Model select the model we created in step 2 and click ok.
data:image/s3,"s3://crabby-images/4444b/4444b75fe74eca9a449b2aaad2daac2522128a87" alt="ai9 ai9"
Then scroll down and under parameters select the column as Bucket URL.
data:image/s3,"s3://crabby-images/f2fd8/f2fd83c8c906464f416bf9db2b5c8355b423ea60" alt="ai10 ai10"
Now again click on + sign and select Save data. Saved data should contain the image name and URLs as well as the key values text/number extracted.
data:image/s3,"s3://crabby-images/b4979/b4979ab87a26fe28f5e9a79700578ed9a1126097" alt="ai11 ai11"
Now after a few minutes, you will see all the data. Data flow will load the images, analyze them with a pre-trained AI model, and extract key values from the images. In this example, the invoices.
data:image/s3,"s3://crabby-images/099e6/099e623aebe0096ad3bfec2306def608fdaeb458" alt="ai12 ai12"
Data flow will load all the information into a dataset that we can use to explore and visualize.
4. Now Visualize the Dataset:
From the Oracle Analytics home page again click on Create and click Workbook
data:image/s3,"s3://crabby-images/21535/21535dcdb42efd9b1aef22ab237324d43f91f2bd" alt="ai13 ai13"
Click on the new dataset generated by the data flow in the previous step and add it to the workbook.
data:image/s3,"s3://crabby-images/c504a/c504a08c6a0af84544b005ecc523f39545289a85" alt="ai14 ai14"
Now from the Visualization, add the custom visualization plugin object to see all the images in your bucket.
On how to download and use custom visualization see my other blog custom plugin
data:image/s3,"s3://crabby-images/fbc84/fbc849e5b4e888b057d4a1eb9f1bf81a4e8e26dc" alt="ai15 ai15"
Create a new table to show all the columns of your dataset or maybe a few columns whatever is required.
something like below
data:image/s3,"s3://crabby-images/ff095/ff095f9103c80cbd1d50dddcc8adbf7834c445f2" alt="ai16 ai16"
I use the filter on image objects so when I click on any image it will show the data of that invoice on the right side.
data:image/s3,"s3://crabby-images/a979d/a979d9ee30b7754dda3497ec545c8a76acfd5a5e" alt="ai17 ai17"
This is cool! You can do many other things like show only invoices where the Invoice total is more than $1000 etc.
You can also store the dataset result in an Oracle Autonomous Database to do more things.