OnoGraph CSV/Excel Upload: Nationality

Users can upload a CSV or Excel file containing names and be returned with a CSV that includes the probabilities of the names' nationality. The data is the same as provided via the nationality API.

Uploading a CSV or Excel file with the following information:

forename surname
John Barnes
Maria Valderrama
Siphesihle Ndlovu
Jianguo Qiang
Sandrine Wamytan
Waleed Shehadeh

will produce the following result:

forename surname jurisdiction_1 jurisdiction_1_percent jurisdiction_2 jurisdiction_2_percent jurisdiction_3 jurisdiction_3_percent
John Barnes United States 67.05682 England 14.71494 Australia 6.24271
Maria Valderrama Colombia 32.89143 Peru 17.91407 Venezuela 11.16194
Siphesihle Ndlovu South Africa 51.42918 Zimbabwe 45.03121 Swaziland 1.5223
Jianguo Qiang China 99.26973 Singapore 0.18147 Taiwan 0.17562
Sandrine Wamytan New Caledonia 97.22222 France 1.85185 French Polynesia 0.74074
Waleed Shehadeh Syria 33.00787 Jordan 25.12671 Palestine 13.62667

Instructions

Follow the below instructions to enrich your Excel and CSV files with nationality data.

1) Begin an upload

Navigate to the upload page in your account and click the Drag and drop or browse file area. Alternatively you can drag a file to that area.

screenshot of the upload file area for OnoGraph
2) Select a file

Select a file from your computer and press Open.

screenshot of selecting a file to upload for OnoGraph

When it has uploaded you will be shown the following success message.

screenshot of a successful file upload

Refresh the page. Below the upload area the file will appear in a list with the status Pending Analysis. Analysis should be complete in around one minute for uncompressed small to medium sized files.

screenshot of an upload pending analysis
3) Configure

When analysis is complete the file's status will change to Configure. Under Actions, click Configure to configure your file for processing.

screenshot of an analysed upload

You will be redirected to a page that has several options to configure your file and shows a preview of your file. Ensure your file is displayed correctly - see the sections for controls and encoding.

screenshot of the configuration layout

You will find the following configuration options at the top of the page:

  • First Name Field - select the column that contains the first name
  • Surname Field - select the column that contains the surname
  • Second Surname Field - select the column that contains the second surname (for Spanish and Portuguese names)

The above fields will provide options for the column name (if your file contains a header), otherwise they will contain values from the first row of your file.

  • Number of Nationalities To Include in Output - the number of nationalities to add to each row, which can be between 1 and 250

Entering 1 will output a result like:

forename surname jurisdiction_1 jurisdiction_1_percent
John Hughes United States 46.15556

While entering 3 will output a result like:

forename surname jurisdiction_1 jurisdiction_1_percent jurisdiction_2 jurisdiction_2_percent jurisdiction_3 jurisdiction_3_percent
John Hughes United States 46.15556 England 14.16929 Australia 4.02602
  • Output Format - whether you would like nationalities to be added to one or multiple columns

The split option will output a result like:

forename surname jurisdiction_1 jurisdiction_1_percent
John Hughes United States 46.15556

While merged will output a result like:

forename surname jurisdiction_1
John Hughes United States::46.15556
  • Insert After Column - select the column to insert OnoGraph data after

For Column SeparatorColumn Enclosure and Character Encoding see the relevant sections.

  • Ignore first row of file (headers) - select to ignore the first row from processing if it contains the column names

For Sanitize names see the relevant section.

  • Trim first name - select to remove middle names from the first name, e.g. if the first name is John Peter the name will be processed as John; particles such as the La in La Tonya and the St in St John are preserved
  • Trim surname - select to remove multiple surnames from the surname and second surname, e.g. if the surname is is dos Santos Teixeira the surname will be processed as dos Santos; particles such as the van der in van der Sar and the Abd El in Abd El Rahaman are preserved

Once configured the page will look something like:

screenshot of a configured upload

The first row is faded as it was selected to be ignored, owing to it containing column names.

The columns containing first name and surname have been highlighted and titles have been added above the columns.

A column with the text OnoGraph data inserted here shows the area where OnoGraph data will be inserted.

Press the Begin Processing button at the top right to queue your file for processing. This will show the following message:

screenshot of a successfuly configured file
4) Processing

Returning to the upload page your file's status will now be Pending Processing.

screenshot of an upload pending processing

Your file may begin processing from one minute after it was configured.

When processing begins its status will be marked as Processing and will show the percentage completed.

screenshot of a file during processing
5) Download

Once complete the Status will be marked as Complete. Under Actions select Download CSV to download your enriched file.

You should download the file at the earliest convenience, as files may be deleted after seven days.

screenshot of a compelte upload

The result will look something like:

example of an output CSV