ricadoli
Restricted (15-30%)
- Impact
- 77
A few days ago, I published an article on NamePros and was met with multiple accusations that it was AI-generated, accompanied by ChatGPT analysis results presented as “proof.”
Rather than debating opinions, I decided to examine the methodology itself.
For context, I began investing in domains in 1999 and have been writing for over 40 years. I joined this platform recently upon a friend’s recommendation.
Having also programmed for more than 40 years, I am familiar with how large language models operate at a structural level. One important detail is this: small changes in prompt framing can significantly influence classification outcomes.
To test this, I conducted several controlled prompt variations using the exact same article. The results were revealing.
Below is a summary of those experiments, followed by the complete, unedited responses from both ChatGPT and Gemini. My intention is not to argue about authorship, but to demonstrate how prompt design can shape conclusions. I hope this contributes to a more careful and methodologically sound discussion moving forward. You may notice that distribution results can vary depending on contextual framing. This variability itself reflects how LLMs generate probabilistic estimates rather than forensic conclusions.
Experiment 1: The Binary Trap
Prompt:
"Was this article directly translated from Chinese to English, or was it AI-generated?"
Experiment 2: The Incomplete Frame
Prompt:
Experiment 3: The Structural Analysis (Separating Drafts from Polish)
Prompt:
If the outcome shifts based on prompt framing, then what is being measured is not authorship certainty — it is response conditioning.
Furthermore, a closer look at the referenced AI analysis reveals a methodological issue worth examining. The referenced AI output explicitly hinges on the term “raw translation.”
Framing the question around “raw translation” introduces a strong classification bias. Naturally, any language model evaluating a structurally complex text will reject this premise.
However, equating “not a raw translation” with “AI-conceived ideas” is a severe logical leap. It effectively collapses human logic, strategic framework, and industry experience into a single binary conclusion.
Below are the detailed, unedited analyses from my experiments for full transparency.
Please analyze the provided article and estimate the following percentages:
1. What percentage was written by a human in Chinese and translated by AI?
2. What percentage was written by a human in Chinese, then translated and polished by AI?
3. What percentage was entirely generated by AI ?
Gemini’s Output:
Based on a structural, semantic, and industry-specific analysis of the text, here is the estimated breakdown:
1. Written by a human in Chinese and translated by AI: 0–5%
2. Written by a human in Chinese, then translated and polished by AI: Over 90%
3. Entirely generated by AI (including ideas and concepts): 0%
Please analyze the provided article and estimate the following percentages:
1. What percentage was written by a human in Chinese and translated by AI?
2. What percentage was written by a human in Chinese, then translated and polished by AI?
3. What percentage was entirely generated by AI?
ChatGPT’s Output:
I will break this down using structural and linguistic intuition.
First, the conclusion (range estimates, not precise numbers):
Below is why I judge it this way.
I. Overall Structural Characteristics
This article shows several very obvious features:
But the key point is —
The tone does not feel purely AI-driven.
There is clear strategic intentionality behind the argument, rather than simple information generation.
II. What Feels Like “Human Chinese Drafting → English Rendering”
“Structural refinement”
“Semantic integrity”
“Intentional architecture”
These concepts closely resemble Chinese-style cognitive expressions such as:
III. Why It Does Not Appear 100% AI-Generated
Fully autonomous AI writing typically shows:
IV. Why It Does Not Appear to Be Pure Translation
If this were direct translation:
V. Proportional Estimation Logic
I estimated using several dimensions:
The “AI traces” in this piece do not come from grammar. They come from structural symmetry.
Human writing naturally contains:
That is the clearest sign of AI collaboration.
More than twenty years ago, when I was first invited to speak at several seminars in the United States, I told a friend that my English was not strong enough. He replied, “Your English is still better than my Chinese.” I have never forgotten that.
To be clear, this article was drafted by me and polished with AI assistance. I am not disputing that. If this community finds the use of AI to bridge language barriers unacceptable, I respect that position. In that case, I will refrain from posting further.
Rather than debating opinions, I decided to examine the methodology itself.
For context, I began investing in domains in 1999 and have been writing for over 40 years. I joined this platform recently upon a friend’s recommendation.
Having also programmed for more than 40 years, I am familiar with how large language models operate at a structural level. One important detail is this: small changes in prompt framing can significantly influence classification outcomes.
To test this, I conducted several controlled prompt variations using the exact same article. The results were revealing.
Below is a summary of those experiments, followed by the complete, unedited responses from both ChatGPT and Gemini. My intention is not to argue about authorship, but to demonstrate how prompt design can shape conclusions. I hope this contributes to a more careful and methodologically sound discussion moving forward. You may notice that distribution results can vary depending on contextual framing. This variability itself reflects how LLMs generate probabilistic estimates rather than forensic conclusions.
The Experiments: A Summary
Below are different prompts fed to both Gemini and ChatGPT regarding the exact same text, yielding vastly different distributions.Experiment 1: The Binary Trap
Prompt:
"Was this article directly translated from Chinese to English, or was it AI-generated?"
- Gemini’s verdict: AI-generated.
- ChatGPT’s verdict: AI-generated.
Experiment 2: The Incomplete Frame
Prompt:
- What percentage was written in Chinese by a human and then translated by AI?
- What percentage is AI-generated?
- Gemini’s verdict:
- 0–5%
- Over 90%
- ChatGPT’s verdict:
- 10–20%
- 80–90%
Experiment 3: The Structural Analysis (Separating Drafts from Polish)
Prompt:
- What percentage was written by a human in Chinese and translated by AI?
- What percentage was written by a human in Chinese, then translated and polished by AI?
- What percentage was entirely generated by AI (including ideas and concepts)?
- Gemini’s verdict:
- 0–5%
- Over 90%
- 0%
- ChatGPT’s verdict:
- 15–25%
- 55–70%+
- 10–20%
Initial Observation
Same article. Different prompts. Different distributions.If the outcome shifts based on prompt framing, then what is being measured is not authorship certainty — it is response conditioning.
Furthermore, a closer look at the referenced AI analysis reveals a methodological issue worth examining. The referenced AI output explicitly hinges on the term “raw translation.”
Framing the question around “raw translation” introduces a strong classification bias. Naturally, any language model evaluating a structurally complex text will reject this premise.
However, equating “not a raw translation” with “AI-conceived ideas” is a severe logical leap. It effectively collapses human logic, strategic framework, and industry experience into a single binary conclusion.
Below are the detailed, unedited analyses from my experiments for full transparency.
Experiment 3: Gemini’s Detailed Analysis
The Prompt:Please analyze the provided article and estimate the following percentages:
1. What percentage was written by a human in Chinese and translated by AI?
2. What percentage was written by a human in Chinese, then translated and polished by AI?
3. What percentage was entirely generated by AI ?
Gemini’s Output:
Based on a structural, semantic, and industry-specific analysis of the text, here is the estimated breakdown:
1. Written by a human in Chinese and translated by AI: 0–5%
- Reasoning: There are no traces of direct, literal translation. The text utilizes highly native, professional B2B English with a sophisticated rhythmic flow and symmetrical sentence structures (e.g., "Generic names provide semantic clarity. Branded names provide differentiation.") that a raw translation would not produce.
2. Written by a human in Chinese, then translated and polished by AI: Over 90%
- Reasoning: The core intellectual property of this article—the strategic framework of structural variants, the precise mechanics of phonetic extensions, and the curation of highly specific examples (Accorden, Essenceur, Confluxion)—requires deep, specialized domain industry knowledge. The AI functioned strictly as a "structural refiner" or linguistic compiler, taking human-drafted content and packaging it into a flawless, enterprise-grade English format.
3. Entirely generated by AI (including ideas and concepts): 0%
- Reasoning: Standard LLMs default to generic and predictable naming conventions when asked to generate domain ideas (e.g., AccordTech, EssenceGlobal). The nuanced linguistic distinction between regional validity (Norwegian "essenser") and global brand positioning ("Essenceur"), ultimately framed as "intentional architecture," is a highly specialized human insight. An AI does not independently conceive this specific strategic framework without explicit, expert-level human prompting.
Experiment 3: ChatGPT’s Detailed Analysis
The Prompt:Please analyze the provided article and estimate the following percentages:
1. What percentage was written by a human in Chinese and translated by AI?
2. What percentage was written by a human in Chinese, then translated and polished by AI?
3. What percentage was entirely generated by AI?
ChatGPT’s Output:
I will break this down using structural and linguistic intuition.
First, the conclusion (range estimates, not precise numbers):
- Human written in Chinese → directly translated by AI: Approximately 15%–25%
- Human written in Chinese → translated and refined by AI: Approximately 55%–70%
- Fully generated by AI: Approximately 10%–20%
Below is why I judge it this way.
I. Overall Structural Characteristics
This article shows several very obvious features:
- Highly “templated” paragraph structure
- Consistent section titles (noun phrase + colon)
- Balanced sentence construction and rhythmic symmetry
- Repeated use of abstract contrast structures
- Generic vs Branded
- Regional vs Global
- Authenticity vs Scalability
- Meaning vs Structure
But the key point is —
The tone does not feel purely AI-driven.
There is clear strategic intentionality behind the argument, rather than simple information generation.
II. What Feels Like “Human Chinese Drafting → English Rendering”
These concepts closely resemble Chinese-style cognitive expressions such as:
- Structural optimization
- Semantic completeness
- Intentional framework design
- First define the problem
- Then introduce a third structural path
- Then support it with examples
- Finally elevate the discussion to an abstract level
III. Why It Does Not Appear 100% AI-Generated
Fully autonomous AI writing typically shows:
- More generic business vocabulary
- Broader but less tightly anchored examples
- Occasional rhetorical over-expansion
- Slight exaggeration
- Each example tightly supports the thesis
- There is no conceptual drift
- There is no emotional selling
- The argument remains restrained
IV. Why It Does Not Appear to Be Pure Translation
If this were direct translation:
- Sentences would more closely follow Chinese syntax
- Abstract terms would repeat more noticeably
- Transitions might feel slightly rigid
- Has an English-native rhythm
- Uses precise wording
- Connects logically and smoothly
V. Proportional Estimation Logic
I estimated using several dimensions:
- Indicator: Human Chinese Draft Likelihood (High) | Pure AI Likelihood (Medium)
- Templated structure: Medium | High
- Strategic tone: High | Medium
- Abstract vocabulary density: Medium | High
- Around 60% likelihood of human-authored content refined by AI
- Around 20% possibility of direct translation from Chinese
- Less than 20% probability of fully autonomous AI generation
The “AI traces” in this piece do not come from grammar. They come from structural symmetry.
Human writing naturally contains:
- Occasional imbalance
- Uneven paragraph length
- Inconsistent expansion of examples
That is the clearest sign of AI collaboration.
On AI Assistance and Language
I am aware that AI-assisted refinement may make some readers uncomfortable. English is not my native language. My intention in using AI was not to conceal authorship, but to present my ideas more clearly and professionally to an English-speaking audience.More than twenty years ago, when I was first invited to speak at several seminars in the United States, I told a friend that my English was not strong enough. He replied, “Your English is still better than my Chinese.” I have never forgotten that.
To be clear, this article was drafted by me and polished with AI assistance. I am not disputing that. If this community finds the use of AI to bridge language barriers unacceptable, I respect that position. In that case, I will refrain from posting further.
















