Re: [802.3_ISAAC] [EXTERNAL] Re: [802.3_ISAAC] Real live example of why

Thread Links	Date Links
Thread Prev	Thread Next	Thread Index	Date Prev	Date Next	Date Index

Hi TJ Houck

I don't think your explanation is wrong, and I agree that you are talking about an important topic for 802.3dm.

I just don't think the Tesla recall and 802.3dm have much to do with it.

I'll also share my thoughts on the booting you mentioned, but please understand that there are many OEMs and they have different system implementations, and my company has different implementations, so please take my story as an example and not representative of anything.

I think you should let the deserilizer system boot before the camera with serilizer. A system with a deserilizer, for example, the head unit, is pre-booted by the vehicle access or intrusion detection system. Once booted, it is expected that the cameras will be powered up (e.g. via power over coax) and booted in the same order.

The camera may be powered after the headunit has booted up, or the headunit may decide to do so. It depends on the features you want to implement in your system. If the headunit decides to power the camera, the DES side will start when the headunit has booted up. If the Headunit AP and the camera are powered together, the boot time of the Headunit AP will be much larger than what we are discussing and normal video transmission will not occur.

I apologize if I am understanding the problem incorrectly.

Also, I didn't mean to imply that what you said about latency was wrong, my English is not good enough and I apologize for misunderstanding.

Thanks for your contribution

Best Regards

Dongok Kim

From: TJ Houck <thouck@xxxxxxxxxxx>
Sent: Friday, October 4, 2024 9:20 PM
To: STDS-802-3-ISAAC@xxxxxxxxxxxxxxxxx
Subject: Re: [802.3_ISAAC] [EXTERNAL] Re: [802.3_ISAAC] Real live example of why latency maters

Hi Dongok and Scott,

@김동옥 책임연구원 전자선행설계팀 – Tesla has been using GMSL and FPD-link for over a decade now for their camera feeds. They have FSD and camera teardowns you can publicly find or pay for online to prove this. How do systems boot today for the camera feed? The diagram in slide 8 shows how the latency numbers are achieved minus the processing information on the SoC. The diagram I put together is very similar to the ones many Tier 1s and OEMs use today to estimate the impact on time to boot. Can you explain why you don’t think what I showed relates to how boot latency times are explicitly calculated for the link we provide?

@Scott.Muma@xxxxxxxxxxxxx: I assume you did not attend the Hamburg conference virtually as all your questions were answered to the scenario you mentioned. Several people talked about byte-wise vs bulk transfer and its pros and cons. Slide 9 in my presentation shows how updates occur to camera sensors, making bulk transfer difficult due to how the data is processed in some scenarios. Several of the cons we discussed of bulk transfer will need to be addressed. On top of that, 1722 I2C bulk transfer has yet to be released as a standard and has not been implemented in any production vehicles today. As we all know, automotive customers take time to adopt, especially for safety-critical applications. This is why minimizing changes to the system is essential. I showed byte-wise because this is what is in production vehicles today, and I outlined several reasons for this while giving my presentation. Having both modes and using bulk for initialization could be beneficial once it gets proven out – because I outlined that there are cons to bulk transfers. I would also be interested in how you concluded: “The 1 second impact TJ calculated on slide 8 seems to be unnecessarily pessimistic.” The scenario I outlined is a real case scenario I came across in my career that I shared with the group. I mentioned during the presentation imagers can exceed this number of registers.

Best Regards,

TJ Houck

From: 김동옥 책임연구원 전자선행설계팀 <DOKIM@xxxxxxxxxxx>
Sent: Friday, October 4, 2024 2:00 AM
To: STDS-802-3-ISAAC@xxxxxxxxxxxxxxxxx
Subject: Re: [802.3_ISAAC] [EXTERNAL] Re: [802.3_ISAAC] Real live example of why latency maters

Dear ALL tp-111-v01-final_tag. pdf (nhtsa. gov) , FMVSS Does this have anything to do with the latency that we are discussing in the FMVSS? I don't think so. The flow of the mail could be misinterpreted as a signal that a back screen of 2sec or

Dear ALL

tp-111-v01-final_tag.pdf (nhtsa.gov) , FMVSS

Does this have anything to do with the latency that we are discussing in the FMVSS? I don't think so. The flow of the mail could be misinterpreted as a signal that a back screen of 2sec or less is acceptable. In any case, 'no data' from the camera is not allowed.

Is Tesla's recall relevant to this discussion? Tesla probably didn't use an LVDS camera, which would have met the latency requirements we're talking about, so why did it happen?

Thanks for your contribution

Best Regards

Dongok Kim

From: Ragnar Jonsson <rjonsson@xxxxxxxxxxx>
Sent: Friday, October 4, 2024 9:40 AM
To: STDS-802-3-ISAAC@xxxxxxxxxxxxxxxxx
Subject: Re: [802.3_ISAAC] [EXTERNAL] Re: [802.3_ISAAC] Real live example of why latency maters

Hi Scott,

All I wanted to communicate with my initial email on this subject is that latency is important.

If you have information that contradict TJ’s analysis, then I would encourage you to bring in concreate examples about current implementations that dispute his analysis. TJ’s analysis is not saying that it is not possible to do better, but rather analyzing what is common practice today.

Regarding your statement “so other than reminding us of the 2 second limit from backup event to image display it doesn’t seem to relate to 802.3dm”, I agree with you. However, the 2 second limit is clearly very important and very relevant to 802.3dm. This is what I wanted to communicate by sharing this link. This is “Real live example of why latency maters” (hence the subject matter for my email).

Ragnar

From: Scott.Muma@xxxxxxxxxxxxx <Scott.Muma@xxxxxxxxxxxxx>
Sent: Thursday, October 3, 2024 4:45 PM
To: Ragnar Jonsson <rjonsson@xxxxxxxxxxx>; STDS-802-3-ISAAC@xxxxxxxxxxxxxxxxx
Subject: RE: [802.3_ISAAC] [EXTERNAL] Re: [802.3_ISAAC] Real live example of why latency maters

Hi Ragnar, I suspect that it’s just very confusing to talk about PHY latency and system level event-to-result latency as both being “latency”. I agree that PHY latency matters up to a point, but this recall doesn’t provide us with any convincing

Hi Ragnar,

I suspect that it’s just very confusing to talk about PHY latency and system level event-to-result latency as both being “latency”.

I agree that PHY latency matters up to a point, but this recall doesn’t provide us with any convincing information about PHY latency. It seems very unlikely that the recall was addressed by reducing the PHY latency, so other than reminding us of the 2 second limit from backup event to image display it doesn’t seem to relate to 802.3dm.

From TJ’s presentation I was able to understand that byte-mode I2C transfers that wait for the far-end to send an ACK for each byte are not appropriate in a multi-hop network and will break the system performance as soon as they are extended beyond the most trivial point-to-point case. The link initialization case of needing to send 5000 register accesses can be addressed in multiple different ways though to take advantage of the network bandwidth without suffering the full end-to-end latency on every byte transferred.

Slide 8 shows there are 5000 commands at 1Mbps taking 140ms, so we can infer there are approximately 5000 bytes of data, and 10000 bytes of address transferred. If the 802.3dm upstream link bandwidth is 100Mbps and even if each command (address+data) goes into its own 64-Byte packet, it should take less than 35ms of link time to transfer all of these packets. Or put another way, a single I2C command will take about 28us, but it will only take 6.7us to send this packet on the upstream link on average. The link will mostly be waiting for the I2C. TJ’s Slide 8 completely ignores any concurrency in this process though which does not seem realistic… the I2C transactions from the SoC to the switch and the commands at the sensor PHY to the imager will also be mostly concurrent with packets in flight. So rather than a total of 830ms a more realistic number here would seem to be link-up time + I2C time + PHY latency = 50ms+140ms+~(PHY latency). So with PHY latencies being discussed of <100us the impact of PHY latency on system initialization will be negligible as long as the system doesn’t insist on receiving a far-end response/ACK back for every I2C command before proceeding with the next command.

Another way to say this would be that in an always-transmitting upstream link it is mostly waiting for the next I2C command. For a TDD link it may buffer a few commands while waiting for a transmit opportunity, and then send a burst of 3-4 command frames at which point it is caught up and waiting on I2C again. In either case, if the higher level protocol allows, multiple I2C commands could be combined into one frame to reduce overhead and conserve bandwidth and reduce contention. By choosing an appropriate I2C handling method the system initialization time can be determined by the I2C interface bandwidth, and be independent of the PHY latency.

In summary, I agree that any of the PHY approaches discussed so far would let 802.3dm achieve acceptable PHY latency and quite good system performance/initialization time, even meeting the 300ms target mentioned. The 1 second impact TJ calculated on slide 8 seems to be unnecessarily pessimistic.

Best Regards,

Scott

From: Ragnar Jonsson <rjonsson@xxxxxxxxxxx>
Sent: Thursday, October 3, 2024 3:41 PM
To: STDS-802-3-ISAAC@xxxxxxxxxxxxxxxxx
Subject: Re: [802.3_ISAAC] [EXTERNAL] Re: [802.3_ISAAC] Real live example of why latency maters

EXTERNAL EMAIL: Do not click links or open attachments unless you know the content is safe

Hi Mehmet,

It is good to see that we agree that latency maters. That was all I wanted to communicate with my original email on this thread.

We may still not be completely aligned on what the latency targets should be, or how important latency is. The short latencies will accumulate and will significantly impact the startup time budget, like Amir explains in this thread and TJ explained on SLIDE 8 of his presentation https://www.ieee802.org/3/dm/public/0924/Houck_Fuller_3dm_03_0917.pdf

Looking more closely at the email thread below, I realize that you may have interpreted my comments to imply that latency is the only thing that matters, which is not at all what I meant. You say below:

“Yes, the latency is important but we should not narrow down the solution space based on microsecond latency difference constraint. Instead, we should keep looking at the big picture and find the best possible solution for 802.3dm.”

I agree with you 100% on this. In fact, I keep pointing out that both ACT and TDD can meet the latency requirements suggested by TJ. If you look at my comparison of ACT and ASA-MLE you will see that I do not mention latency at all in my comparison (see https://www.ieee802.org/3/dm/public/0924/jonsson_razavi_3dm_01_09_15_24.pdf). This is because 2.5Gbps and 5Gbps ASA-MLE can meet the latency suggested by TJ. The 10Gbps ASA-MLE has too high latency, but this can easily be fixed.

Ragnar

From: Amir Bar-Niv <abarniv@xxxxxxxxxxx>
Sent: Thursday, October 3, 2024 2:00 PM
To: STDS-802-3-ISAAC@xxxxxxxxxxxxxxxxx
Subject: Re: [802.3_ISAAC] [EXTERNAL] Re: [802.3_ISAAC] Real live example of why latency maters

Mehmet,

The PHY latency of microseconds creates a huge difference in the system performance/latency.

In the use-case that TJ described in slide 8 in https://www.ieee802.org/3/dm/public/0924/Houck_Fuller_3dm_03_0917.pdf (which is applicable to the live example that Ragnar shared below), there is a very big difference between the “12us” proposed by TJ, compare to the “100us” proposed in https://www.ieee802.org/3/dm/public/0724/matheus_dm_02b_latency_07152024.pdf (slide 9): This is the difference between 60ms (based on 12us) to 500ms (based on 100us), with the example of 5000 commands initialization process.

This is almost 0.5 second that you eat from the “2 seconds” target of NHTSA. This is 25% of the budget. So Yes, few tens of microseconds difference in PHY latency makes a huge difference in overall system performance and the ability to meet the NHTSA requirements.

Thanks,

Amir

From: Mehmet Tazebay <00002b0d1a6e9d74-dmarc-request@xxxxxxxxxxxxxxxxx>
Sent: Thursday, October 3, 2024 1:24 PM
To: STDS-802-3-ISAAC@xxxxxxxxxxxxxxxxx
Subject: [EXTERNAL] Re: [802.3_ISAAC] Real live example of why latency maters

Hi Ragnar, There is no disagreement that the latency matters. We are in agreement there. However, the disconnect still stands for 1. The 8 seconds application latency failure in the news that you’ve quoted (where the OEM failed 2 seconds requirement

Hi Ragnar,

There is no disagreement that the latency matters. We are in agreement there.

However, the disconnect still stands for

1. The 8 seconds application latency failure in the news that you’ve quoted (where the OEM failed 2 seconds requirement hence causing the recall)

2. The real applications have a built-in ~50 milliseconds latency (as presented in Hamburg and even before)

3. While we are dwelling on the impact of the PHY latency (in the order of tens of microseconds!) on #1 and #2 which are in seconds.

The key takeaway for me, yes, the latency matters but the impact of the PHY latency is really minuscule compared to other big ticket items as described before by the others in the group.

Would you not agree?

Mehmet

From: Ragnar Jonsson <rjonsson@xxxxxxxxxxx>
Sent: Thursday, October 3, 2024 11:14 AM
To: Mehmet Tazebay <mehmet.tazebay@xxxxxxxxxxxx>; STDS-802-3-ISAAC@xxxxxxxxxxxxxxxxx
Subject: RE: [EXTERNAL] Re: [802.3_ISAAC] Real live example of why latency maters

Hi Mehmet,

The direct relationship with our discussion is to slide 8 of TJ’s presentation in Hamburg: https://www.ieee802.org/3/dm/public/0924/Houck_Fuller_3dm_03_0917.pdf

This slide talks explicitly about the 2 second requirement and how it relates to link latency.

The takeaway is that latency matters.

Ragnar

On Oct 3, 2024, at 1:50 PM, Mehmet Tazebay <000007de4eafb912-dmarc-request@xxxxxxxxxxxxxxxxx> wrote:

Ragnar,

Thank you for bringing this piece of news to our attention but I do not follow how 8 seconds violation relates to the topic of microseconds PHY latency discussion here. This particular OEM is violating by seconds and that is a very big issue but we are discussing microseconds for the PHY here!

All,

Just to put everything in context, even though we are trying to be conscious of latency impact -as we should be-, here we are discussing PHY latency, for example, between ~5 microseconds versus ~25 microseconds. On the other hand, the real application has about 50 milliseconds latency according to the data that has been shown by Gollob, Matheus, and others. From what I understand this is coming from a real application and that is a lot more than the PHY latency!

So, what does this all mean? Yes, the latency is important but we should not narrow down the solution space based on microsecond latency difference constraint. Instead, we should keep looking at the big picture and find the best possible solution for 802.3dm.

Regards,

Mehmet

On Oct 3, 2024, at 11:25 AM, Ragnar Jonsson <rjonsson@xxxxxxxxxxx> wrote:

All,

We have been having considerable discussion about latency, and how important it is. In case anyone was only thinking of this as an academic problem, I urge you to read about the recall of 27,000 Cybertruks because of latency issue with the rearview camera. According to https://www.cnn.com/2024/10/03/business/tesla-cybertruck-recall-october-2024/index.html:

“The rearview display might appear blank for up to 8 seconds when the Cybertruck is put in reverse, according to a filing from the National Highway Traffic Safety Administration (NHTSA). That’s well beyond the 2 seconds required by US federal safety rules.”

I think that this is a very strong argument in favor of what TJ has been preaching on latency. This relates directly to slide 8 of TJ’s presentation in Hamburg: https://www.ieee802.org/3/dm/public/0924/Houck_Fuller_3dm_03_0917.pdf

I also like to remind everyone of Ariel’s comment in Hamburg, where he pointed out that the 2 seconds are for the whole system, so the camera needs to be up and running in much shorter time.

The bottom line is this: Latency matters.

Ragnar

To unsubscribe from the STDS-802-3-ISAAC list, click the following link: https://listserv.ieee.org/cgi-bin/wa?SUBED1=STDS-802-3-ISAAC&A=1