Is the writing I submit used to train AI?

No. Your writing is used only to run the simulation and is never used to train models. This is stated in our Terms of Service, and right after analysis you choose whether to delete or keep the original yourself.

Can't I just ask ChatGPT to "review this from a woman in her 30s' perspective"?

You can, but the result isn't the same. ChatGPT imagines a single hypothetical reviewer — a 'Korean woman in her 30s' — from within its training distribution. Ilkim follows the Statistics Korea (KOSIS) distribution to draw N readers who, even within women in their 30s, vary in occupation, region, education, and interests. You see the distribution, not the average. ChatGPT also tends to rate your own writing favorably. Ilkim's personas aren't designed to flatter — on a dull paragraph they simply drop off, and that's what gets recorded.

ChatGPT is free — is there really a reason to pay?

To mimic 30 readers with ChatGPT, you'd prompt it 30 times and tally the answers into an average by hand. It takes over an hour, and you still don't get a distribution. Ilkim runs 30 personas at once in a single click and organizes the drop-off points, completion rate, and segment-level responses for you. What you save isn't money — it's time.

How closely do the personas' responses match real readers?

We target 70%+ similarity based on beta-user evaluation. During the beta we share cases where simulation results were validated against the real comments and responses on already-published writing.

Does this replace focus groups or surveys?

It fills the stage before them rather than fully replacing them. Outside quantitative research typically costs a few hundred to several thousand dollars and takes 2–4 weeks. Ilkim shows segment-level responses quantitatively in 90 seconds, right before you publish — so you can narrow down which hypotheses are worth researching, or settle smaller decisions without commissioning a study at all.

Can it analyze writing in English or Japanese, not just Korean?

For now it's specialized for Korean content and Korean readers, because the dataset follows the Statistics Korea distribution. Global expansion needs separate data infrastructure — it's on the roadmap, but the Korean market is the priority.

Company security makes it hard to send our writing to an external API. What then?

On-premise and private deployment options for in-house enterprise content teams are available under the Enterprise plan. We provide a package that runs on your internal GPU environment with no external transfer of processed data.

If I upgrade from Free to PRO, is my data kept?

Yes. All projects and analysis history are preserved. Downgrading is also possible — on downgrade only what exceeds the monthly limit is deactivated, and no data is deleted.

← All posts

Why Asking ChatGPT to 'Review This as a 30-Something Woman' Misleads You

June 12, 20263 min readIlkim Team

Before publishing, when you want to know "how will this read," a lot of people now ask ChatGPT something like: "Evaluate this as a 30-something woman would." You get a fast, plausible answer. But there's a structural trap hiding in that prompt.

A general LLM imagines one average person

Ask ChatGPT to "review this as a 30-something woman," and the model conjures a single, average image of the label "30-something woman" and performs it. The problem: real 30-something women are not one person.

Within the same age and gender, occupation, region, education, interests, and free time vary wildly. A marketer working in Seoul, a self-employed shop owner in a small city, a parent on childcare leave, and a graduate student all read the same piece differently. Some bounce at the first paragraph; others read to the end and share.

A general LLM's answer flattens all of them into a single average. The average is convenient, but it hides how content actually spreads.

Content lives or dies in the tails of the distribution

Reach and virality are usually decided not by the average reaction but by the extremes — the tails of the distribution.

A small group that reacts strongly shares the content, and reach explodes.
A specific segment bails at the first sentence, and even a respectable average score can't save real reach.

"On average, not bad" misses both risks. What you actually need before publishing is "who reacts strongly, and who quietly leaves." That never comes from a single average evaluator.

The average gives you the illusion of safety. What actually makes or breaks content is the minority far from the mean.

How to see the distribution, not the average

The fix is simple: instead of one imagined evaluator, build a crowd that mirrors the real population and let each member read.

Ilkim samples multiple synthetic personas that follow the population distribution from KOSIS (Statistics Korea). Even among "30-something women," occupation, region, and interests are spread out the way the statistics say they are. Each reads your draft from their own vantage point and returns completion/drop-off, a score, and a comment. The result isn't one number — it's a distribution of reactions.

This is built on NVIDIA's Nemotron-Personas-Korea dataset (CC BY 4.0) together with KOSIS distributions. "A statistically grounded crowd" rather than "one imagined person" is the decisive difference from a general LLM.

When you actually need this

Not every piece needs a distribution analysis. But in these situations, the opinion of one average persona is risky.

Broad-audience content — magazine articles or brand campaigns that must reach many kinds of readers.
Content where drop-off is fatal — landing copy or newsletters where first-paragraph bounce dictates conversion.
Content that's hard to fix after publishing — print, press releases, anything you can't quietly edit once it's out.

For pieces like these, it's safer to confirm "who reads it and how" before publishing — not just "it's fine on average."

In short: asking ChatGPT to role-play a specific reader is fast, but it collapses to one average person. Because reach is decided in the tails of the distribution, pre-publish validation should look at the reactions of a crowd that mirrors the real statistical distribution.

Reader simulation
AI personas
Content validation