OnoGraph CSV/Excel Upload: Gender

Users can upload a CSV or Excel file containing names and be returned with a CSV that includes the probabilities of the names' gender. The data is the same as provided via the gender API.

Uploading a CSV or Excel file with the following information:

forename surname
Kari Lindkvist
Dayanara Bethancourth
Onyebuchi Anyanwu
Gopi Caughari
Goi Korowa
Ramzy El Mahadeen

will produce the following result:

forename surname world_gender world_percent world_sample jurisdiction_1_name jurisdiction_1_gender jurisdiction_1_percent jurisdiction_1_sample
Kari Lindkvist f 68 21,5176 India f 78 66,785
Dayanara Bethancourth f 98 2,997 Philippines f 100 1,257
Onyebuchi Anyanwu m 82 26,208 Nigeria m 82 26,137
Gopi Caughari m 88 128,242 India m 89 119,254
Goi Korowa m 56 1,558 Papua New Guinea m 78 625
Ramzy El Mahadeen m 85 15,517 Jordan m 88 7,593

The gender columns list either m or f for male and female; and the percent column lists the percentage chance someone with a name being that gender.

For example, in the above table 68% of people in the world with the name Kari are female; and 78% of people in India with the name Kari are female.

Instructions

Follow the below instructions to enrich your Excel and CSV files with gender data.

1) Begin an upload

Navigate to the upload page in your account and click the Drag and drop or browse file area. Alternatively you can drag a file to that area.

screenshot of the upload file area for OnoGraph
2) Select a file

Select a file from your computer and press Open.

screenshot of selecting a file to upload for OnoGraph

When it has uploaded you will be shown the following success message.

screenshot of a successful file upload

Refresh the page. Below the upload area the file will appear in a list with the status Pending Analysis. Analysis should be complete in around one minute for uncompressed small to medium sized files.

screenshot of an upload pending analysis
3) Configure

When analysis is complete the file's status will change to Configure. Under Actions, click Configure to configure your file for processing.

screenshot of an analysed upload

You will be redirected to a page that has several options to configure your file and shows a preview of your file. Ensure your file is displayed correctly - see the sections for controls and encoding.

screenshot of the configuration layout

You will find the following configuration options at the top of the page:

  • First Name Field - select the column that contains the first name

The above field will provide options for the column name (if your file contains a header), otherwise it will contain values from the first row of your file.

  • Jurisdiction - select a jurisdiction you would always like to be included in your results
  • Number of Jurisdictions - the number of jurisdictions to add to each row, which can be between 1 and 3 (if a jurisdiction is not set) and 4 (if a jurisdiction is set)

Entering 1 will output a result like:

forename surname jurisdiction_1_name jurisdiction_1_gender jurisdiction_1_percent jurisdiction_1_sample
John Hughes United States m 100 377,2318

While entering 2 will output a result like:

forename surname jurisdiction_1_name jurisdiction_1_gender jurisdiction_1_percent jurisdiction_1_sample jurisdiction_2_name jurisdiction_2_gender jurisdiction_2_percent jurisdiction_2_sample
John Hughes United States m 100 3,772,318 England m 100 553,238
  • Output Format - whether you would like jurisdictions to be added to one or multiple columns

The split option will output a result like:

forename surname jurisdiction_1_name jurisdiction_1_gender jurisdiction_1_percent jurisdiction_1_sample
John Hughes United States m 100 3,772,318

While merged will output a result like:

forename surname jurisdiction_1
John Hughes United States::m::100:3,772,318
  • Insert After Column - select the column to insert OnoGraph data after

For Column SeparatorColumn Enclosure and Character Encoding see the relevant sections.

  • Ignore first row of file (headers) - select to ignore the first row from processing if it contains the column names

For Sanitize names see the relevant section.

  • Trim first name - select to remove middle names from the first name, e.g. if the first name is John Peter the name will be processed as John; particles such as the La in La Tonya and the St in St John are preserved
  • Only results for jurisdiction - when selected data will only be added to rows that include the selected jurisdiction; credits will not be expended for empty rows

Once configured the page will look something like:

screenshot of a configured upload

The first row is faded as it was selected to be ignored, owing to it containing column names.

The column containing the first name has been highlighted and a title has been added above it.

A column with the text OnoGraph data inserted here shows the area where OnoGraph data will be inserted.

Press the Begin Processing button at the top right to queue your file for processing. This will show the following message:

screenshot of a successfuly configured file
4) Processing

Returning to the upload page your file's status will now be Pending Processing.

screenshot of an upload pending processing

Your file may begin processing from one minute after it was configured.

When processing begins its status will be marked as Processing and will show the percentage completed.

screenshot of a file during processing
5) Download

Once complete the Status will be marked as Complete. Under Actions select Download CSV to download your enriched file.

You should download the file at the earliest convenience, as files may be deleted after seven days.

screenshot of a compelte upload

The result will look something like:

example of an output CSV