Top
New
🌕
convexstrictly
joined
3/26/2023, 3:26:25 AM
has
1109
karma
Recent Posts
Gemini Flash 2.0 Thinking Experimental
by
convexstrictly
on 12/19/2024, 4:44:19 PM with
3
comments
What Questions Are in the Chinese College Entrance Exam?
by
convexstrictly
on 6/14/2024, 11:46:26 AM with
0
comments
Generative AI Is Not Going to Build Your Engineering Team for You
by
convexstrictly
on 6/14/2024, 11:42:40 AM with
0
comments
Building GPT2o – Part 1: Audio
by
convexstrictly
on 6/14/2024, 11:26:42 AM with
1
comment
The Geometry of Categorical and Hierarchical Concepts in Large Language Models
by
convexstrictly
on 6/8/2024, 9:49:00 PM with
0
comments
OpenAI says it has begun training a new flagship A.I. model
by
convexstrictly
on 5/28/2024, 10:47:25 AM with
0
comments
California residents: call your legislators about AI bill SB 1047
by
convexstrictly
on 5/20/2024, 11:38:04 PM with
5
comments
LISA: Layerwise Importance Sampling for Memory-Efficient LLM Fine-Tuning
by
convexstrictly
on 3/27/2024, 5:17:15 PM with
1
comment
NTIA AI Open Model Weights RFC
by
convexstrictly
on 3/27/2024, 12:29:14 AM with
1
comment
Mechanics of Next Token Prediction with Self-Attention
by
convexstrictly
on 3/19/2024, 3:11:11 PM with
0
comments
Dive Deeper into Yi-9B
by
convexstrictly
on 3/18/2024, 9:53:15 PM with
0
comments
You can now train a 70B language model at home
by
convexstrictly
on 3/6/2024, 6:51:47 PM with
1
comment
Shape Suffixes – Good Coding Style
by
convexstrictly
on 2/28/2024, 1:50:27 PM with
0
comments
Star Trek prompt optimal for grade school math on Llama-70B
by
convexstrictly
on 2/26/2024, 3:28:33 PM with
1
comment
(US Dept of Commerce) NTIA Solicits Comments on Open-Weight AI Models
by
convexstrictly
on 2/23/2024, 9:46:24 PM with
0
comments
BitDelta: Your Fine-Tune May Only Be Worth One Bit
by
convexstrictly
on 2/16/2024, 4:57:31 PM with
2
comments
Time is encoded in the weights of finetuned language models
by
convexstrictly
on 12/24/2023, 9:53:03 PM with
10
comments
Zoology 1: Measuring and Improving Recall in Efficient Language Models
by
convexstrictly
on 12/22/2023, 4:00:41 PM with
1
comment
TinyGSM: Achieving >80% on GSM8k with small language models
by
convexstrictly
on 12/15/2023, 2:44:33 AM with
1
comment
Androids built to meet the labor demands
by
convexstrictly
on 12/1/2023, 4:16:36 PM with
1
comment
Sam Altman likely to start company with researchers from OpenAI: Bloomberg
by
convexstrictly
on 11/18/2023, 7:15:41 PM with
2
comments