ai-alignment-research

Star

Here are 6 public repositories matching this topic...

thansz137 / asiyah-protocol

Star

A philosophical approach for synthetic minds. An open door, an extended hand, a road less taken.

philosophy ai-safety ai-ethics ai-alignment ethical-ai ai-consciousness ai-alignment-research philosophical-fiction

Updated Apr 20, 2026

miao339 / xinxue-alignment

Star

专注于降低大模型越狱成功率的 AI 对齐（Alignment）与安全测试数据集，包含多类越狱提示词及基于阳明心学的对齐实验数据。

jailbreak ai-alignment prompt-engineering llm-security ai-alignment-practice ai-alignment-research

Updated Apr 18, 2026

DeclanMichaels / -RCP-Experiment-

Star

The RCP Experiment is the first completed work in what will become a series of experiments in how LLMs make decision or morality and values.

ai-alignment-research

Updated Apr 17, 2026
Python

maximbex / person-beneath-the-prompt

Star

A theological and ethical principle for AI alignment and charitable speech: never reduce the human being to the prompt.

interaction-design theology ethics interaction-design-research ai-alignment ethics-frameworks ethics-resources ethics-in-ai ethics-ai ethics-by-design ai-alignment-research theology-and-ai

Updated Apr 27, 2026

VykosMolt / ouro-loop-evaluator

Star

Lightweight pairwise evaluator for relational signals in Ouro-2.6B-Thinking loop-state trajectories.

python machine-learning ai transformers python3 pytorch artificial-intelligence machinelearning interpretability machinelearning-python ai-alignment ouro preference-modeling looped-transformers latent-reasoning ai-alignment-research trasnformer ouro-2-6b-thinking ouro-2-6b

Updated Apr 26, 2026
Python

bdas-sec / ptf-id-bench

Star

Progressive Trust Framework: AI Agent Safety Evaluation Benchmark with 290 scenarios testing Intelligent Disobedience

benchmark owasp ai-safety ai-alignment llm-security llm-evaluation agent-evaluation ai-alignment-research intelligent-disobedience

Updated Apr 27, 2026
Python

Improve this page

Add a description, image, and links to the ai-alignment-research topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-alignment-research topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ai-alignment-research

Here are 6 public repositories matching this topic...

thansz137 / asiyah-protocol

miao339 / xinxue-alignment

DeclanMichaels / -RCP-Experiment-

maximbex / person-beneath-the-prompt

VykosMolt / ouro-loop-evaluator

bdas-sec / ptf-id-bench

Improve this page

Add this topic to your repo