I got some more questions from support in the ticket a while ago, but to be honest, I’m not keen on replying and going through this entire ticket process, which involves getting more questions and running experiments over and over again.
I believe in this specific case, it’s mostly a waste of time. There’s good evidence to believe that this a firmware bug, and everything needed to understand it is in this thread. I spent a lot of time on this already: I wrote specific instructions to reproduce the issue, and I even wrote a workaround. Now it’s Framework’s responsibility to spend more time on it and look into the details.
make utils
sudo cp build/bds/util/ectool /usr/local/bin
you (might?) also need to disable secure boot.
I just set this up under the same conditions. Been seeing this under ubuntu. Will report back if it seems to fix my monitor disconnecting.
Related to the thread: The core problem is the usb-c port resetting when power changes rapidly, which happens with the official brick and nothing else plugged in. I think support is getting confused when we say “Dock” or “monitor” or whatever. They want to troubleshoot all the dock things but unless the power is stable with the official supply these docks are never going to work right.
Thanks for the daemon and investigation! This thread was very hard to find.
As I wrote here, I don’t think that’s true. The connection to the charger is stable with (and without) the official charger. It’s the EC firmware that decides not to draw power, and this can cause USB-C/thunderbolt disconnects.
I know! I was the one who spent days tracking this down and reading the EC firmware code.
I’ll give it a try because I want to see this fixed, but I really feel it’s your responsibility right now to spend time on it. I spent days tracking this down, reading EC firmware code, etc.
Support is ignoring my story and asking questions like “do you see this on other OS” but I clearly stated in the thread that people have reported this on all kinds of operating systems. I gave the same reply last time, but they keep ignoring what I write. Last time they asked me “Does the issue only happen when plugged in or in battery mode?” when even the title of my report already says that this is only when AC is connected. It’s hardly possible to have a meaningful conversation with your colleagues if they don’t read what I write in the ticket.
Honestly @Matt_Hartley , it does feel like they are ignoring the data, or not properly noting / escalating. In my last request, they asked for all the same files I originally sent to the initial support contact.
I also explained to the support rep that there is nothing in the data, and according to what @real_or_random found, they won’t see anything in journal dumps as it is happening at a lower level.
I am still willing to capture this if I get time this weekend, but it requires that I disable the service that @real_or_random created to address this and basically break my laptop again, so I cannot do it during the work week.
If you feel like it’s going no where, and you have worked through the steps requested, ask for it to be escalated. That is reasonable if the requested steps and processes have been met.
I am looking at your ticket reply from the 17th (yesterday). You’ve indicated a workaround and looks like Support is waiting for you to provide the requested information. Generally speaking once this is done, it will be escalated from there unless they have direct insights/suggestions.
One of the biggies is the logs. Once you’ve submitted that, I suspect it will be escalated that same day.
Per support:
“With the logs, kindly send it to us as we still need the results for documentation purposes.”
Everyone reading this, allow me to reset expectations.
Support uses a process. It may not be ideal in your eyes, but there is method to it. Please follow it for the best results.
Support is going to slowed down as we are slammed right now. No one is being ignored, but there is a hefty queue they’re working through.
Posting here does not change anything above. Most of the time, issues we see folks facing are due to third party devices involved, missed steps or in some cases, a needed RMA after all possible other causes have been exhausted. Standard stuff.
While the process feels long and painful, no one is being ignored, dismissed or otherwise treated as if there issue experienced isn’t valid.
That said. I am not looking to discuss this further as I have a multitude of other customers needing attention. They, like you, need my assistance.
And with that, all tickets will absolutely get followed up on and escalated as described above when the escalation requirements are met. Thank you for your cooperation.
If you would like to share workarounds, solutions and ideas, please do so - we encourage this. If you are looking to vent frustration or echo existing tickets, I will be closing this thread down.
I do not want to do that as there is valuable insights shared here, but this is getting long in the tooth.
Posting this here. It’s a quick shell script I put together based on what was sent from support. If someone is able to reproduce this sooner than I am, and has a case already feel free to use it to grab it.
It will generate a tar.gz containing the requested data to “${HOME}/Downloads/” by default, but it can be modified by changing the path defined in the DIRECTORY variable.
Nothing crazy, but figured I would put it out there to try to get this thing moving faster.
Of course, now that I am trying to force this to happen, I cannot get it to trigger. I disabled the service provided by @real_or_random , and I think I reverted the EC settings looking through it, but I cannot get this to happen and it’s been three days.
If there is anything you can think of that I missed in the list below I would be happy to keep driving this with support if I can break it again.
Do you still have a battery limit <100% set in the BIOS? Have you tried the script with a battery level between 31% and 99%? This is required for the fwchargelimit commands to be effective. (Only then fwchargelimit 30 stops charging and fwchargelimit 99 starts charging.)
Otherwise you may actually be lucky. I can still reproduce it reliably with the script.
Yea, I used the script, and I still have it set to 80% charge in the BIOS. I agree that I may be lucky (though it feels the opposite now that I am trying to capture it), I also ran for months before I encountered this before. I will go ahead and leave it in this state, and see if I can get it to come back.
After months of this problem, I decided to keep track of the charging power with node_expoter and prometheus.
Turns out, even when fully charged, the battery will sometimes report charging of a couple mW (from 20ish to 200ish). This doesn’t seem to have a hard correlation with system usage.
But another very interesting thing I noticed after playing around with the official ubuntu battery life guide: I turned off boosting of the CPU completely with TLP. And the “battery chargin / discharging” desktop notifications are now completely gone. So the main culprit of the flipping seems to be the very short bust of required power during clock boosts.
Here’s a quick graph showing the charge value (calculated with node_power_supply_current_ampere * node_power_supply_voltage_volt from node_expoter.
I now spend an afternoon playing around with different settings in TLP and the result is…
confusing?
No matter what I configure now, I can’t get the power flip-flop to trigger. Even with boosting enabled. (including GPU boosting).
The “charging” power keeps at absolutely 0 in my monitoring, and the “fully charged” status doesn’t budge even for a second. And before it was happily doing it multiple times a minute, at times.
The only thing I can see is, that the maxinum CPU frequencies are not reported as high as before.
You can see that node_exporter detects the max frequency at 4.6GHz for the P-cores. Which is the maximum spec for my 1340P. (the short down-spikes you can see are moments where I disconnected power to move the laptop).
I thought those were the actual frequencies the core will boost to, so node exporter reports those.
If I had the laptop disconnected, you can see the reported max to be lower (as boost was disabled on battery)
But now I can’t get node_exporter to detect the 4.6GHz, no matter what I do. And I suspect that is connected with the fact that the connection is not flip-flopping anymore.
Here’s a screenshot of my current TLP CPU settings
Additionally, it seems my Ugreen 45W charger also works like a charm now.
I tested it under full load (using all cores with blender rendering)
Note: I have a system tray widget that shows me the power drain/charge from the battery by using/sys/class/power_supply/BAT1/current_now and /sys/class/power_supply/BAT1/voltage_now
started off as “fully charged”
after 10ish seconds, my widget showed me a battery drain of 0.5W
A couple seconds later it increased to 2W
Only after another couple seconds (maybe half a minute after starting the render), my system reported the battery discharging
When the render was done, the system reported normal charging of the battery
After it was full again, the system now is still happy with the 45W charger. I now even attached my phone to it, so the charging is halfed. Still no problems